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FIBER SHAFT MODIFICATIONS FOR EFFICIENT TARGETING 
RELATED APPLICATIONS 

Benefit of priority is claimed to U.S. provisional application Serial No. 
60/350,388, filed 24 January 2002, entitled "FIBER SHAFT MODIFICATIONS 
5 FOR EFFICIENT TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R., and to U.S. provisional application Serial No. 
60/391,967, filed 26 June 2002, entitled "FIBER SHAFT MODIFICATIONS FOR 
EFFICIENT TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, 
Theodore and Nemerow, Glen R. This application is also related to International 

10 PCT application No. (attorney docket number 22908-1236), filed the same day 
herewith, entitled "FIBER SHAFT MODIFICATIONS FOR EFFICIENT 
TARGETING," to Stevenson, Susan C, Kaleko, Michael, Smith, Theodore and 
Nemerow, Glen R. Where permitted, the subject matter of each of these 
applications is incorporated by reference herein. 

15 FIELD OF INVENTION 

The present invention generally relates to the field of adenoviral vectors 
and the production of such vectors. In particular, detargeted adenoviral vectors 
are provided. 
BACKGROUND 

20 Most, if not all, adenoviral vector-mediated gene therapy strategies aim to 

transduce a specific tissue, such as a tumor or an organ. Systemic delivery will 
require ablation of the normal virus tropism as well as addition of new 
specificities. Multiple interactions between adenoviral particles and the host cell 
are required to promote efficient cell entry (Nemerow (2000) Virology 274:1-4). 

25 An adenovirus entry pathway is believed to involve two separate cell surface 
events. First, a high affinity interaction between the adenoviral fiber knob and 
coxsackie-adenovirus receptor (CAR) mediates the attachment of the adenovirus 
particle to the cell surface. A subsequent association of penton with the cell 
surface integrins aj? 3 and ajff 5 , which act as co-receptors, potentiates virus 

30 internalization. There are a plurality of adenoviral fiber receptors, which interact 
with the group B {e.g., Ad3) and group C (e.g., Ad5) adenoviruses. Both of 
these groups of adenoviruses appear to require interaction with integrins for 
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internalization. CAR ablation, however, does not change biodistribution and 
toxicity of adenoviral vectors in vivo (Alemany et af. (2001) Gene Therapy 
5:1347-1353; U.S. patent application No. 09/870,203, filed May 30, 2001, and 
published as U.S. Published application No. 20020137213). Thus, the role of 
5 CAR interaction for in vivo gene transfer is not clear. Recently published studies 
have described conflicting results (Alemany et at. (2001) Gene Therapy 
3:1347-1353; Leissner et aL (2001) Gene Therapy 5:49-57; Einfeld eta/. (2001) 
J. Virology 75:1 1 284-1 1 291 ). For example/it has been shown that vectors 
containing an S408E mutation in the Ad5 fiber AB loop yield efficient liver 
10 transduction in mice, despite having greatly reduced transduction efficiencies on 
cells in culture (see, Leissner et aL (2001) Gene Therapy 5:49-57). In contrast, 
vectors containing a more extensive fiber AB loop mutation showed a 10-fold 
reduction in liver gene expression (see, Einfeld et aL (2001) J. Virology 
75:11284-11291). 

15 A doubly ablated adenovirus has been prepared by modifying the CAR 

binding region in the fiber loop and the integrin binding region in the penton base 
(Einfeld et aL (2001 ) J. Virology 75\\ 1 284-1 1 291 ): This doubly ablated 
adenovirus, lacking CAR and integrin interactions, was reported not only to lack 
in vitro transduction of various cell types but also to lack in vivo transduction of 

20 liver cells. Specifically, the doubly ablated adenovirus was reported to have a 
700 fold reduction in liver transduction when compared to the non-ablated 
adenovirus. These results, however, were not reproduced by others. 

For many applications, the most clinically useful adenoviral vector would 
be deliverable systemically, such as into a peripheral vein, and would be targeted 

25 to a desired location in the body, and would not have undesirable side effects 
resulting from targeting to other locations. In vivo adenoviral vector targeting is 
a major goal in gene therapy and a significant effort has been focused on 
developing strategies to achieve this goal. Successful targeting strategies would 
direct the entire vector dose to the appropriate site and would be likely to 

30 improve the safety profile of the vector by permitting the use of lower, less toxic 
vector doses, which potentially also can be less immunogenic. Thus, there is a 
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need to develop adenoviruses which are fully detargeted in vivo for use as a 
base vector for producing redirected adenoviruses. 

Therefore, among the objects herein, it is an object herein to provide fully 
detargeted adenoviral vectors, methods for preparation thereof, and uses 
5 thereof. 
SUMMARY 

Detargeted and fully detargeted adenoviral particles, adenovirus vectors 
from which such particles are produced, methods for preparation of the vectors 
and particles and uses of the vectors and particles are provided. Provided and 

10 described are capsid modifications, such as fiber shaft modifications, and the 
resuling proteins that, when expressed on adenoviral particles provide for 
detargeting of adenoviral vectors. The capsid modifications, such as the fiber 
shaft modifications, can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

15 adenoviral particles. Thus, adenoviral vectors and adenovirual particles whose 
native tropisms are ablated through a modification or modifications of capsid 
proteins, particularly a fiber shaft region, are provided. 

Thus, provided are capsid mutiations, including fiber shaft modifications, 
that ablate binding to particular receptors, thereby permitting efficient targeting 

20 of adenoviral vectors that contain capsids with such modifications. For example, 
adenoviral vectors in which the fiber shaft's interaction with HSP is ablated 
(reduced or substantially eliminated), particuarly in vivo, are provided. These 
fiber shaft modifications can be combined with other modifications, such as fiber 
knob and/or penton modifications, to produce fully ablated (detargeted) 

25 adenoviral vectors. Also provided are retargeted vectors and particles that 

include a ligand or ligands to provide for targeting of the detargeted vectors and 
particles to selected cells and/or tissues. Retargeting can be effected, for 
example, by manipulating the fiber protein to redirect the receptor specificity to a 
particular cell type. 

30 Also provided are nucleic acids encoding the modified fiber proteins and 

also modified penton proteins. Also provided are nucleic acids encoding the 
modified fiber shaft protein that has ablated HSP binding and combinations 
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thereof with other modified fiber regions or other proteins, such as a modified 
fiber knob region and/or the modified penton protein. The nucleic acids also can 
contain heterologous nucleic acid sequences, such as promoters or nucleic acid 
sequences encoding polypeptides. The viral particles that express fibers 
5 containing such shaft modifications and other modifications are also provided. 

Also provided are methods for making and using the adenoviral particles 
that express the modified fibers and combinations of modified fibers and 
modified penton. With the fiber shaft modifications, particularly in combination 
with the fiber knob modifications and the penton modifications, the adenovirus 

10 particles are ablated for binding to their natural cellular receptors), i.e., they are 
detargeted. They can then be "retargeted" to a specific cell type through the 
addition of a ligand to the virus capsid, which causes the virus to bind to and 
infect such cell. The ligand can be added, for example, through genetic 
modification of a capsid protein gene. 

15 Also provided is a method for reducing liver toxicity in adeno- 

viral-mediated therapy. In contrast to the results of Einfeld et al. (Einfeld et aL 
(2001 ) J. Virology 75:1 1284-1 1 291), it is shown herein that a doubly ablated 
adenovirus, lacking CAR and integrin interactions, is capable of in vivo liver 
transduction. It is shown herein that ablation of liver transduction requires 

20 further and/or alternative modification(s). The method for reducing liver toxicity 
in adenoviral-mediated therapy includes 

modifying an adenoviral vector to ablate native tropism to liver cells in vivo. 
Such vector can be administered to a subject. The modifications include the 
modifications described herein. 

25 The nucleic acids, proteins, adenoviral particles and adenoviral vectors 

have a variety of uses. These include in vivo and in vitro uses to target nucleic 
acid to particular cells and tissues, for therapeutic purposes, including gene 
therapy, and also for the identification and study of cell surface receptors and 
identification of modes of interaction of viruses with cells. 

30 In particular, adenoviral fiber shaft modifications that ablate viral 

interaction with HSP (Heparin Sulfate Proteoglycans; also referred to as heparin 
sulfate glycosaminoglycans) are provided. These modifications include 
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mutations of individual amino acids in the fiber shaft that interact with HSP or 
mutations of amino acids in the fiber shaft that modify the ability of the HSP 
binding motif to interact with HSP. Adenoviral fiber shaft modifications also 
include replacements of fiber shafts using fiber shafts of adenoviruses, such as, 
5 for example, Ad3, Ad35 and Ad41 short fiber shaft, that do not contain HSP 
binding sites. 

Also provided are adenoviral fiber shaft modifications that alter, 
particularly ablate viral interaction with HSP, as described above, in combination 
with fiber knob modifications that ablate viral interaction with CAR. The fiber 

10 knob modifications include: (a) mutations of individual amino acids in the fiber 
loop that interact with CAR, such as, for example, AB or CD loop modifications; 
(b) mutations of individual amino acids in the fiber loop that modify the ability of 
the CAR binding motif to interact with CAR; and (c) replacements of fiber knobs 
using adenoviruses that do not interact with CAR, such as, for example, Ad3 

15 fiber knob, Ad41 short fiber knob, or Ad35 fiber knob. 

Also provided are adenoviral fiber shaft modifications as described above 
in combination with penton modifications that ablate viral interaction with a v 
integrins. The penton modifications include: (a) mutations of individual amino 
acids that interact with cr v integrins; (b) mutations of individual amino acids that 

20 modify the ability of the a v integrin binding motif to interact with the a v integrins; 
and (c) replacement of penton proteins using penton proteins from adenoviruses 
that do not interact with the a v integrins. 

Also provided are adenoviral fiber shaft modifications as described above 
in combination with fiber knob modifications as described above and penton . 

25 modifications as described above. 

Also provided is a scale-up method for the propagation of detargeted 
adenoviral vectors. The method uses polycations and/or bifunctional reagents, 
which when added to tissue culture medium results in entry of adenoviral 
particles into the producer cells. 
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Provided aer recombinant viral particles that contain a modified capid 
protein whereby binding to heparin sulfate proteoglycans (HSP) is reduced or 
eliminated compared to particles that contain unmodified capsid proteins. The 
modified capsid proteins include fibers proteins with modified shafts such that 
5 binding to HSP is reduced or eliminated. 

Among the particular embodiments the following are provided. Provided 
are adenovirus capsid proteins that are modified to alter, typically reduce or 
eliminate, binding to or interaction with in vivo and/or in vitro to heparin sulfate 
proteoglycan (HSP). HSPs are expressed on various cells, including 
10 hepatocytes. It is shown herein that HSPs provide for or participate in 

transduction of cells, such as liver cells. Since it can be desirable to eliminate or 
reduce such transduction, the modifications of the capsid proteins, such as fiber 
proteins, permit detargeting of particles that express such proteins from such 
cells. 

15 Thus provided are modified adenovirus fiber proteins that include a 

mutation, such as an insertion, deletion, change, replacement of amino acids or 
combinations thereof, whereby binding to or interaction with heparin sulfate 
proteoglycan (HSP) is altered. In particular, the he binding of the modified fiber 
protein is eliminated or reduced compared to the unmodified protein. Exemplary 

20 of these mutations are mutations in the shaft of a fiber, where the shaft also can 
include the tail. The mutations can reduce or alter the affinity of the fiber 
protein for HSP is reduced at least by 2-fold, 5-fold, 10-fold, 100-fold or more, 
including substantially eliminating it. 

As provided herein, fibers from adenoviruses that interact with HSP can 

25 include a motif, such as BBXB or BBBXXB, wherethe B is a basic amino acid and 
X is any amino acid, particularly the consensus sequence KKTK in Ad5 and Ad2. 
Thus, provided are fibers in which the motif is altered to eliminate or reduce 
interaction with HSP. 

Also provided are modified fiber protein of claim 1 that are chimeras in 

30 which the fiber shaft (or fiber shaft and tail) are derived from a fiber, such as 
Ad3, Ad35, Ad7, Adl 1, Ad16, Ad21, Ad34, Ad40, Ad41 or Ad46 fiber, that 
does not interact with HSP and combined with fiber that does interact, such as 
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Ad5 or Ad2 fiber, to produce a complete fiber whose binding to HSP is reduced 
or eliminated. 

All of the modified capsids proteins provided herein also can include one 
or more further modifications that reduce or eliminate interaction of the resulting 
5 fiber with one or more cell surface proteins, such as but not limited to, CAR and 
a v integrin or other receptor to which a particular native fiber binds, in addition to 
HSP. These modifications include, but are not limited to, modification to fiber 
that reduces or eliminates CAR binding and modification to penton that reduces 
or eliminate a v integrin binding. The mutations can be in the fiber knob, shaft, 

10 tail and shaft, and also in penton. 

Any and all of the modified capsid proteins provided herein can further 
include a Jigand that binds to a particular receptor thereby endowing a fiber (or 
other capsid protein) with binding specificity or the ability to interact with such 
receptor. The ligand can be inserted into any suitable site in a capsid protein, 

15 such as an insertion or replacement. For example, fibers with ligands inserted 
into the knob region are exemplified. Any such ligand can be employed and a 
variety are exemplified herein. 

A variety of modified capsid proteins are exemplified herein. These 
include, but are not limited to, fibers containing: the sequence of amino acids 

20 set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 7.0 and 72; or a 
sequence of amino acids having 60%, 70%, 80%, 90%, 95% or greater 
sequence identity with a sequence of amino acids set forth in any of SEQ ID 
Nos. 52, 54, 56, 58, 62, 66, 70 and 72; or a sequence of amino acids encoded 
by a sequence of nucleotides that hybridizes under conditions of high stringency 

25 along at least 70% of its length to a sequence of nucleotides that encodes a 

sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 
70 and 72. 

Nucleic acids encoding the capsid proteins, including the fibers are also 
provided. The nucleic acids can be provided as vectors, particularly as 
30 adenovirus vectors. Many adenoviral vectors are known and can be modified as 
needed in accord with the description herein. Adenoviral vectors include, but 
are not limited to, early generation adenoviral vectors, such as E1 -deleted 



WO 03/062400 PCT/US03/02295 



-8- 

vectors, gutless adenoviral vectors and replication-conditional adenoviral vectors, 
such as oncolytic adenoviral vectors. The adenovirus vectors also can include 
heterologous nucleic acids that encode or provide products, such as therapeutic 
products. Any therapeutic product is contemplated and a variety are set forth 
5 herein as exemplary. Heterologous nucleic acid can encode a polypeptide or 
comprise or encode a regulatory sequence, such as a promoter or an RNA, 
including RNAi, small RNAs, other double-stranded RNAs, antisense RNA, and 
ribozymes. Promoters include, for example, constitutive and regulated 
promoters and tissue specific promoter, including tumor specific promoters. 

10 The promoter can be operably linked, for example, to a gene of an adenovirus 
essential for replication. 

Cells containing the nucleic acid molecules and cells containing the 
vectors are also provided. Such cell include packaging cells. The cells can be 
prokaryotic or eukaryotic cells, including, mammalian cells, such a primate cells, 

15 including human cells. 

Also provided are adenoviral particles that contain the modified capsid 
proteins provided herein. The particles have altered interaction or binding with 
HSP compared to particles that do not contain the modified capsid proteins. In 
addition to altered binding to HSP, which is typically reduced or eliminated 

20 binding, the particles can include further modifications, such as capsid proteins 
with altered interaction with other receptors as described above. In particular, 
the particles can have altered, typically reduced or eliminated, interaction with 
CAR, a v integrin and/or other receptors. The mutation include mutations in the 
fiber knob, penton and hexon. Exemplary fiber know mutations are mutations in 

25 the AB loop or CD loop, such as KOI or K012, which are described herein. In 
addition, the particles can include additional ligands for retargeting to selected 
receptors. The adenoviral particles can be from any serotype and subgroup. 

Methods for expressing heterologous nucleic acids in a cell are provided. 
In these methods an adenoviral vector provided herein is transduced into a cell 

30 to deliver the nucleic acid and/or encoded products. Transduction can be 
effected in vivo or in vitro or ex vivo, and can be for a variety of purposes 
including study of gene expression and genetic therapy. The cells can be 
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prokaryotic cells, but typically are eukaryotic cells, including mammalian cells, 
such as primate, including human, cells. The cells can be of a specific type, such 
as a tumor cell or a cell in a particular tissue. The vectors can be oncolytic 
vector to effect killing of tumor cells. 
5 Since the modified capsid proteins herein have reduced or eliminated 

binding to HSP, viral particles containing such proteins exhibit ablated binding to 
HSP in vitro and in vivo. Thus provided is a method of reducing transduction of 
cells that express HSP, such as hepatocytes in the liver, by modifying a capsid 
protein, such as fiber to eliminate or reduce interaction with or binding to HSP. 

10 Such reduction reduces or eliminates transduction of cells that express HSP, 
including liver cells. 

Also provided are scale-up methods for the propagation of detargeted 
adenoviral particle, such as those provided herein. The method includes the 
steps of infecting or transducing a cell capable of replicating, maturing and 

15 packaging an adenoviral vector with a detargeted adenoviral vector in the 

presence of a reagent that results in entry of the adenoviral particle into the cell, 
such as a polycation and/or a bif unctional protein' or other such reagent; and 
culturing the infected cell under conditions suitable for growth, spread and 
propagation of the adenoviral vector. The resulting adenoviral particles can be 

20 recovered. Polycations include, but are not limited to, hexadimethrine bromide, 
polyethylenimine, protamine sulfate and poIy-L-lysine. Bifunctional proteins, 
include, but are not limited to, an anti-fiber antibody ligand fusion, an 
anti-fiber-Fab-FGF conjugate, an anti-penton-antibody ligand fusion, an 
anti-hexon antibody ligand fusion and a polylysine-peptide fusion. The ligand is 

25 selected to bind to a particular receptor. 

The viral particles that express a modified capsids provided herein can be 
produced by this method. The modification include, for example, one or more 
mutations selected from among mutations that reduce or eliminate interactions 
with one or more of a v integrins, coxsackie-adenovirus receptors (CAR) and 

30 heparin sulfate proteoglycans (HSP). Such mutations include, for example, PD1, 
K01, K012 and S*. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a plasmid map for pSKOI . 

Figure 2 is a plasmid map for pNDSQ3.1 KOI . 

Figures 3A-3C are plasmid maps of pAdmireRSVnBg.(Fig. 3A), pSQ1 Fig. 
5 3B) and pSQ1KO!2 (Fig. 3C) 

Figure 4 is a plasmid map for pSQ1PD1 . 

Figures 5A-5B are plasmid maps of pSQ1FK01PD1 (Fig. 5A) and 
pSQ1KO!2PD1 (Fig. 5B). 

Figure 6 shows in vitro transduction efficiency of A549 cells using 
10 adenoviral vectors containing fiber AB loop knob and/or penton, PD1 mutations. 
The following adenoviral vectors were used in these studies: AvInBg, 
AvInBgFKOI , referred to as FKOI, Av1 nBgPDI , referred to as PD1, and 
Av1 nBgFKOI PD1 that is referred to as FK01PD1 . 

Figure 7A-7B shows in vivo adenoviral-mediated liver gene expression 
15 (Fig. 7A) and hexon DNA content (Fig, 7B) using adenoviral vectors containing 
fiber AB loop knob and/or penton, PD1 mutations. The following adenoviral 
vectors were used in these studies: AvInBg, AvInBgFKOI, referred to as 
FKOI, Av1nBgPD1, referred to as PD1 , Av1 nBgFKOI PD1 , referred to as 
FK01PD1 , AvInBgKOI 2, referred to as K012, and Av1 nBgKOI 2PD1 that is 
20 referred to as KOI 2PD1 . 



Figures 13A-13C show the transduction efficiency of A549 and HeLa 
cells using adenoviral vectors containing fiber shaft, knob and/or penton 
mutations. Fig. 13A shows the dose response for the transduction efficiency of 
A549 cells. Fig. 13B shows the transduction efficiency of HeLa cells at 2000 
30 ppc. Figure 13C shows the competition analysis of adenoviral vectors 
containing fiber shaft mutations. 



25 



Figure 8 is a plasmid map for pFBshuttle(EcoRI). 
Figure 9 is a plasmid map for pSQ1 HSP. 
Figure 10 is a plasmid map for pSQIHSPKOI. 
Figure 1 1 is a plasmid map for pSQ1HSPPD1. 
Figure 12 is a plasmid map for pSQ1 HSPKOl PD1 . 



WO 03/062400 



PCT/US03/02295 



-11- 

Figures 14A-14B shows the influence of fiber shaft mutations on in vivo 
adenoviral-mediated liver gene expression (Fig. 14A) and hexon DNA content 
{Fig. 14B). 

Figures 15A-15B are plasmid maps of pSQIHSPRGD (Fig. 15A) and 
pSQIHSPKOIRGD (Fig. 15B). 

Figure 1 6 shows that insertion of a RGD targeting ligand can restore 
transduction of the vectors containing the HSP binding shaft S* mutation. 

Figures 17A-17B are plasmid maps of pSQ1 AD35Fiber (Fig. 17A) and 
pSQ1Ad35FcRGD (Fig. 17B). 

Figures 18A-18B are maps of plasmids encoding 35F chimeric fibers. Fig. 
18A is a plasmid map of pSQ135T5H, and Fig. 18B is a plasmid map of 
pSQ15T35H. 

Figure 1 9 shows the results of an in vitro analysis of Ad5 vectors 
containing Ad35 fibers and derivatives thereof. 

Figure 20 shows the results of an in vivo analysis of Ad5 vectors 
containing Ad35 fibers and derivatives thereof. 

Figures 21A-21B are plasmid maps of pSQ1Ad41sF (Fig. 21 A) and 
pSQ1 Ad41 sFRGD (Fig. 21 B). 

Figure 22 shows the results of an in vivo analysis of Ad 5 vectors 
containing Ad41 short fiber. 

Figure 23 shows the in vitro analysis of Ad5 based vectors containing the 
Ad41 short fiber which has been re-engineered to contain a cRGD ligand in the 
HI loop. 

Figure 24 shows enhanced transduction of AE1-2a cells with the 
Av3nBgFKO! detargeted adenoviral vector using hexadimethrine bromide (HB), 
protamine sulfate (PS) and poly-lysine-RGD (K14) or the anti-penton-TNFa 
bifunctional protein (open-TNF). 

Figure 25 shows ablation of HSP interaction decreases 
adenoviral-mediated gene transfer to other organs 

Figure 26 shows in vivo liver transduction with adenoviral vectors which 
encode for B-galactosidase and contain various mutations to the fiber and/or 
penton proteins. Results are plotted as percent transduction as compared to 
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wild type. Two different methods for determining the level of transduction are 
shown for each vector. 

Figure 27 shows the adenoviral vector biodistribution to the liver and 
tumor for the vectors containing the S*, KOIS*, and 41sF fibers. 
5 DETAILED DESCRIPTION 
A. DEFINITIONS 

Unless defined otherwise, all technical and scientific terms used herein 
have the same meaning as is commonly understood by one of skill in the art to 
which the invention(s) belong. All patents, patent applications, published 

10 applications and publications, Genbank sequences, websites and other published 
materials referred to throughout the entire disclosure herein, unless noted 
otherwise, are incorporated by reference in their entirety. In the event that there 
are a plurality of definitions for terms herein, those in this section prevail. 
Where reference is made to a URL or other such identifier or address, it 

15 understood that such identifiers can change and particular information on the 
internet can come and go, but equivalent information is known and can be 
readily accessed, such as by searching the internet and/or appropriate 
databases. Reference thereto evidences the availability and public dissemination 
of such information. 

20 As used herein, the term "adenovirus" or "adenoviral particle" is used to 

include any and all viruses that can be categorized as an adenovirus, including 
any adenovirus that infects a human or an animal, including all groups, 
subgroups, and serotypes. Depending upon the context reference to 
"adenovirus" can include adenoviral vectors. There are at least 51 serotypes of 

25 Adenovirus that classified into several subgroups. For example, subgroup A 

includes adenovirus serotypes 12, 18, and 31. Subgroup C includes adenovirus 
serotypes 1, 2, 5, and 6. Subgroup D includes adenovirus serotype 8, 9, 10, 13, 
15, 17, 19, 20, 22-30, 32, 33, 36-39, and 42-49. Subgroup E includes 
adenovirus serotype 4. Subgroup F includes adenovirus serotypes 40 and 41 . 

30 These latter two serotypes have a long and a short fiber protein. Thus, as 
used herein an adenovirus or adenovirus particle is a packaged vector or 
genome. 
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As used herein, "virus," "viral particle," "vector particle," "viral vector 
particle," and "virion" are used interchangeably to refer to infectious viral 
particles that are formed when, such as when a vector containing all or a part of 
a viral genome, is transduced into an appropriate cell or cell line for the 
5 generation of such particles. The resulting viral particles have a variety of uses, 
including, but not limited to, transferring nucleic acids into cells either in vitro or 
in vivo. For purposes herein, the viruses are adenoviruses, including recombinant 
adenoviruses formed when an adenovirus vector, such as any provided herein, is 
encapsulated in an adenovirus capsid. Thus, a viral particle is a packaged viral 

10 genome. An adenovirus viral particle is the minimal structural or functional unit 
of a virus. A virus can refer to a single particle, a stock of particles or a viral 
genome. The adenovirus (Ad) particle is relatively complex and may be resolved 
into various substructures. 

Included among adenoviruses and adenoviral particles are any and all 

15 viruses that can be categorized as an adenovirus, including any adenovirus that 
infects a human or an animal, including all groups, subgroups, and serotypes. 
Thus, as used herein, "adenovirus" and "adenovirus particle" refer to the virus 
itself and derivatives thereof and cover all serotypes and subtypes and naturally 
occurring and recombinant forms, except where indicated otherwise. Included 

20 are adenoviruses that infect human cells. Adenoviruses can be wildtype or can 
be modified in various ways known in the art or as disclosed herein. Such 
modifications include, but are not limited to, modifications to the adenovirus 
genome that is packaged in the particle in order to make an infectious virus. 
Exemplary modifications include deletions known in the art, such as deletions in 

25 one or more of the E1a, E1b, E2a, E2b, E3, or E4 coding regions. Other 

exemplary modifications include deletions of all of the coding regions of the 
adenoviral genome. Such adenoviruses are known as "gutless" adenoviruses. 
The terms also include replication-conditional adenoviruses, which are viruses 
that preferentially replicate in certain types of cells or tissues but to a lesser 

30 degree or not at all in other types. For example, among the adenoviral particles 
provided herein, are adenoviral particles that replicate in abnormally proliferating 
tissue, such as solid tumors and other neoplasms. These include the viruses 
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disclosed in U.S. Patent No. 5,998,205 and U.S. Patent No. 5,801,029. Such 
viruses are sometimes referred to as "cytolytic" or "cytopathic" viruses (or 
vectors), and, if they have such an effect on neoplastic cells, are referred to as 
"oncolytic" viruses (or vectors). 
5 As used herein, the terms "vector," "polynucleotide vector," 

"polynucleotide vector construct," "nucleic acid vector construct," and "vector 
construct" are used interchangeably herein to mean any nucleic acid construct 
that can be used for gene transfer, as understood by those skilled in the art. 
As used herein, the term "viral vector" is used according to its 

10 art-recognized meaning. It refers to a nucleic acid vector construct that includes 
at least one element of viral origin and can be packaged into a viral vector 
particle. The viral vector particles can be used for the purpose of transferring 
DNA, RNA or other nucleic acids into cells either in vitro or in vivo. Viral vectors 
include, but are not limited to, retroviral vectors, vaccinia vectors, lentiviral 

15 vectors, herpes virus vectors (e.g., HSV), baculoviral vectors, cytomegalovirus 
(CMV) vectors, papillomavirus vectors, simian virus (SV40) vectors, Sindbis 
vectors, semliki forest virus vectors, phage vectors, adenoviral vectors, and 
adeno-associated viral (AAV) vectors. Suitable viral vectors are described, for 
example, in U.S. Patent Nos. 6,057,155, 5,543,328 and 5,756,086. The 

20 vectors provided herein are adenoviral vectors. 

As used herein, "adenovirus vector" and "adenoviral vector" are used 
interchangeably and are well understood in the art to mean a polynucleotide 
containing all or a portion of an adenovirus genome. An adenoviral vector, refers 
to nucleic encoding a complete genome or a modified genome or one that can be 

25 used to introduce heterologous nucleic acid when transferred into a cell, 

particularly when packaged as a particle. An adenoviral vector can be in any of 
severa | forms, including, but not limited to, naked DNA, DNA encapsulated in an 
adenovirus capsid, DNA packaged in another viral or viral-like form (such as 
herpes simplex, and AAV), DNA encapsulated in liposomes, DNA complexed 

30 with polylysine, complexed with synthetic polycationic molecules, conjugated 
with transferrin, complexed with compounds such as PEG to immunologically 
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"mask" the molecule and/or increase half-life, or conjugated to a non-viral 
protein. 

As used herein, oncolytic adenoviruses refer to adenoviruses that 
replicate selectively in tumor cells 
5 As used herein, a variety of vectors with different requirements and 

purposes are described. For example, one vector is used to deliver particular 
nucleic acid molecules into a packaging cell line for stable integration into a 
chromosome. These types of vectors also are referred to as complementing 
plasmids. A further type of vector carries or delivers nucleic acid molecules in or 

10 into a cell line {e.g., a packaging cell line) for the purpose of propagating viral 

vectors; hence, these vectors also can be referred to herein as delivery plasmids. 
A third "type" of vector is the vector that is in the form of a virus particle 
encapsulating a viral nucleic acid and that is comprised of the capsid modified as 
provided herein. Such vectors also can contain heterologous nucleic acid 

15 molecules encoding particular polypeptides, such as therapeutic polypeptides or 
regulatory proteins or regulatory sequences to specific cells or cell types in a 
subject in need of treatment. 

As used herein, the term "motif" is used to refer to any set of amino 
acids forming part of a primary sequence of a protein, either contiguous or 

20 capable of being aligned to certain positions that are invariant or conserved, that 
is associated with a particular function. The motif can occur, not only by virture 
of the primary sequence, but also as a consequence of three-dimensional folding. 
For example, the motif GXGXXG is associated with nucleotide-binding sites. In 
this fiber is a trimer, hence the trimeric structure can contribute fornation of a 

25 motif. Alternatively, a motif can be considered as a domain of a protein, where 
domain is a region of a protein molecule delimited on the basis of function 
without knowledge of and relation to the molecular substructure, as, e.g., the 
part of a protein molecule that binds to a receptor. As shown herein, the motif 
KKTK constitutes a consensus sequence for fiber shaft interaction with HSP. 

30 As used herein, the term "bind" or "binding" is used to refer to the 

binding between a ligand and its receptor, such as the binding of an Ad5 shaft 
motif with HSP (Heparin Sulfate Proteoglycans), with a K d in the range of 10-2 



WO 03/062400 PCT/US03/02295 



■16- 



V15 



to 10-15 mole/l, generally, 10 6 to 10' 15 , 10" 7 to 10 15 and typically 10 8 to 10" 
(and/or a K a of 10 5 -10 12 , 10 7 -10 12 , 10 8 -10 12 l/mole). 

As used herein, specific binding or selective binding means that a the 
binding of a particular ligand and one receptor interaction (k a or K^) is at least 2- 
5 fold, generally, 5, 10, 50, 100 or more-fold, greater than for another receptor. 
A statement that a particular viral vector is targeted to a cell or tissue means 
that its affinity for such cell or tissue in a host or in vitro is at least about 2-fold, 
generally, 5, 10, 50, 100 or more-fold, greater than for other cells and tissues in 
the host or under the in vitro conditions. 

10 As used herein, the term "ablate" or "ablated" is used to refer to an 

adenovirus, adenoviral vector or adenoviral particle, in which the ability to bind 
to a particular cellular receptor is reduced or eliminated, generally substantially 
eliminated (i.e.. reduced more than 10-fold, 100-fold or more) when compared to 
a coresponding wild-type adenovirus. An ablated adenovirus, adenoviral vector 

15 or adenoviral particle also is said to be detargeted, i.e., the modified adenovirus, 
adenoviral vector or adenoviral particle does not possess the native tropism of 
the wild-type adenovirus. The reduction or elimination of the ability of the 
mutated adenovirus fiber protein and/or mutated adenovirus penton protein to 
bind a cellular receptor as compared to the corresponding wild-type fiber protein 

20 and/or wild-type penton protein can be measured or assessed by comparing the 
transduction efficiency (gene transfer and expression of a marker gene) of an 
adenovirus particle containing the mutated fiber protein and/or mutated penton 
protein compared to an adenovirus particle containing the wild-type fiber protein 
and/or wild-type penton protein for cells having the cellular receptor. 

25 As used herein, tropism with reference to an adenovirus refers to the 

selective infectivity or binding that is conferred on the particle by a capsid 
protein, such as the fiber protein and/or penton. 

As used herein, "penton" or "penton complex" is used herein to designate 
a complex of penton base and fiber. The term "penton" can also be used to 

30 indicate penton base, as well as penton complex. The meaning of the term 
"penton" alone should be clear from the context within which it is used. 
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As used herein, the term "substantially eliminated" refers to a 
transducticm efficiency less than about 1 1 % of the efficiency of the wild-type 
fiber containing virus on HeLa cells. The transduction efficiency on Hela cells 
can be measured (see, e.g., Example 1 of U.S. Patent Application Serial No. 
5 09/870,203 filed on 30 May 2001, and published as U.S. Published application 
No. 20020137213, and of International Patent Application No. PCT/EP0 1/06 286 
filed 1 June 2001). Briefly, HeLa ceils are infected with the adenoviral vectors 
containing mutated fiber proteins to evaluate the effects of fiber amino acid 
mutations on CAR interaction and subsequent gene expression. Monolayers of 

10 HeLa cells in 12 well dishes are infected with, for example, 1000 particles per 
cell for 2 hours at 37° C. in a total volume of, for example, 0.35 ml of the 
DMEM containing 2% FBS. The infection medium is then aspirated from the 
monolayers and I ml of complete DMEM containing 10% FBS was added per 
well. The cells are incubated for an sufficient time, generally about 24 hours, to 

1 5 allow for /?-galactosidase expression, which is measured by a chemiluminescence 
reporter assay and by histochemical staining with a chromogenic substrate. The 
relative levels of ^-galactosidase activity are determined using as suitable 
system, such as the Galacto-Light chemiluminescence reporter assay system 
(Tropix, Bedford, Mass.) Cell monolayers are washed with PBS and processed 

20 according to the manufacturer's protocol. The cell homogenate is transferred to 
a microfuge tube and centrifuged to remove cellular debris. Total protein 
concentration is determined, such as by using the bicinchoninic acid(BCA) 
protein assay (Pierce, Inc., Rockford, III.) with bovine serum albumin as the 
assay standard. An aliquot of each sample is then incubated with the Tropix 

25 jff-galactosidase substrate for 45 minutes in a 96 well plate. A luminometer is 
used determine the relative light units (RLU) emitted per sample and then 
normalized for the amount of total protein in each sample (RLU/ug total protein). 
For the histochemical staining procedure, the cell monolayers are fixed with 
0.5% glutaraldehyde in PBS, and then were incubated with a mixture of 1 mg of 

30 5-bromo-4-chloro-3-indolyh9-D-galactoside (X-gal) per ml, 5 mM potassium 

ferrocyanide, 5 mM potassium ferricyanide and 2 mM MgCI 2 in 0.5 ml of PBS. 
The monolayers are washed with PBS and the blue cells are visualized by light 
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microscopy, such as with a Zeiss ID03 microscope. Generally, the efficiency is 
less than about 9%, and typically is less than about 8%. 

As used herein, the phrase "reduce" or "reduction" refers to a change in 
the efficiency of transduction by the adenovirus containing the mutated fiber as 
5 compared to the adenovirus containing the wild-type fiber to a level of about 
75% or less of the wild-type on HeLa cells. Generally, the change in efficiency 
is to a level of about 65% or less than wild-type. Typically it is about 55% or 
less. This system is able to rapidly analyze modified fiber proteins and/or 
modified penton proteins for desired tropism in the context of the viral particle. . 

10 As used herein, the term "mutate" or "mutation" or similar terms refers to 

the deletion, insertion or change of at least one amino acid in the part of the 
fiber shaft region interacting with HSP. The amino acid can be changed by 
substitution or by modification in a way that derivatizes the amino acid. Thus, 
for example a BBXB motif or BBBXXB motif, where B is a basic amino acid, in an 

15 adenovirus is mutated to ablate the viral interaction with HSP. 

As used herein, the term "polynucleotide" means a nucleic acid molecule, 
such as DNA or RNA, that encodes a polynucleotide. The molecule can include 
regulatory sequences, and is generally DNA. Such polynucleotides are prepared 
or obtained by techniques known by those skilled in the art in combination with 

20 the teachings contained therein. 

As used herein, adenoviral genome is intended to include any adenoviral 
vector or any nucleic acid sequence comprising a modified fiber protein. All 
adenovirus serotypes are contemplated for use in the vectors and methods 
herein. 

25 As used herein, the term "viral vector" is used according to its art- 

recognized meaning. It refers to a nucleic acid vector construct that includes at 
least one element of viral origin and can be packaged into a viral vector particle. 
The viral vector particles can be used, for example, for transferring DNA into 
cells either in vitro or in vivo. 

30 As used herein, a packaging cell line is a cell line that is able to package 

adenoviral genomes or modified genomes to produce viral particles. It can 
provide a missing gene product or its equivalent. Thus, packaging cells can 
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provide complementing functions for the genes deleted in an adenoviral genome 
(e.g., the nucleic acids encoding modified fiber proteins) and are able to package 
the adenoviral genomes into the adenovirus particle. The production of such 
particles require that the genome be replicated and that those proteins necessary 
5 for assembling an infectious virus are produced. The particles also can require 
certain proteins necessary for the maturation of the viral particle. Such proteins 
can be provided by the vector or by the packaging cell. 

As used herein, detargeted adenoviral particles have ablated (reduced or 4 
eliminated) interaction with receptors with which native particles. Fully 

10 detargeted particles have two or more specificities altered. It is understood that 
in vivo no particles are fully ablated such that they do not interact with any cells. 
Degareted and fully degarated have reduced, typically substantial! reduced, or 
eliminated interaction with native receptors. For purposes herein, detargeted 
particles have reduced (2-fold, 5-fold, 10-fold, 100-fold or more) binding or 

1 5 virtually no binding to HSP receptors; fully degareted vectors include further 

capsid modifications to eliminate interactions with other receptors, such as CAR 
and integrins or other receptors. The particles still bind to cells, but the types of 
cells and interactions are reduced. 

As used herein, pseudotyping describes the production of adenoviral 

20 vectors having modified capsid protein or capsid proteins from a different 

serotype than the serotype of the vector itself. One example, is the production 
of an adenovirus 5 vector particle containing an Ad37 or Ad35 fiber protein. 
This can be accomplished by producing the adenoviral vector in packaging cell 
lines expressing different fiber proteins. As provided herein, detargeting of an 

25 adenovirus 5 particle or other serotype group C adenovirus or other adenovirus 
that binds to HSP to reduce or eliminate binding to HSPs can be effected by 
replacing all or a portion that includes the shaft or at least the HSP consensus 
binding sequence of the Ad5 fiber with an adenovirus fiber or portion thereof 
that does not bind to HSP. Adenoviruses having fiber shafts that do not interact 

30 with HSP include (a) adenoviruses of subgroup B, e.g., Ad3, Ad35, Ad7, Ad1 1, 
Ad16, Ad21, Ad34 which do not have interaction with HSP, (b) adenoviruses of 
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subgroup F, e.g., Ad40 and Ad41, specifically the short fiber, and (c) 
adenoviruses of subgroup D, e.g., Ad46. 

As used herein, receptor refers to a biologically active molecule that 
specifically or selectively binds to (or with) other molecules. The term "receptor 
5 protein" can be used to more specifically indicate the proteinaceous nature of a 
specific receptor. 

As used herein, the term "cyclic RGD" (or cRGD) refers to any amino acid 
that binds to a v integrins on the surface of cells and contains the sequence RGD 
(Arg-Gly-Asp). 

10 As used herein, the term "heterologous polynucleotide" means a 

polynucleotide derived from a biological source other than an adenovirus or from 
an adenovirus of a different strain or can be a polynucleotide that is in a different 
locus from wild-type virus. The heterologous polynucleotide can encode a 
polypeptide, such as a toxin or a therapeutic protein. The heterologous 

15 polynucleotide can contain regulatory regions, such as a promoter regions, such 
as a promoter active in specific cells or tissue, for example, tumor tissue as 
found in oncolytic adenoviruses. Alternatively, the heterologous polynucleotide 
can encode a polypeptide and further contain a promoter region operably linked 
to the coding region. 

20 As used herein, reference to an amino acid in an adenovirus protein or to 

a nucleotide in an adenovirus genome is with reference to Ad5, unless specified 
otherwise. Corresponding amino acids and nucleotides in other adenovirus 
strains and modified strains and in vectors can be identified by those of skill in 
the art. Thus recitation of a mutation is intended to encompass all adenovirus 

25 strains that process a corresponding locus. 

As used herein, the KO mutations refer to mutations in fiber that knock 
out binding to CAR. For example, a KOI mutation refers to a mutation in the 
Ad5 fiber and corresponding mutations in other fiber proteins. In Ad5, this 
mutation results in a substitution of fiber amino acids 408 and 409. changing 

30 them from serine and proline to glutamic acid and alanine, respectively. As used 
herein, a K012 mutation refers to a mutation in the Ad5 fiber and corresponding 
mutations in other fiber proteins. In Ad5, this mutation is a four amino acid 
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substitution as follows: R512S, A515G, E516G, and K517G. Other KO 
mutations can be identified empirically or are known to those of skill in the art. 

As used herein, PD mutations refer to mutations in the penton gene that 
ablate binding by the encoded to a v integrin by replacing the RGD tripeptide. 
5 The PD1 mutation exemplified herein results in a substitution of amino acids 337 
through 344 of the Ad5 penton protein, HAIRGDTF (SEQ ID No. 9), with amino 
acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thereby replacing the RGD 
tripeptide. 

As used herein, treatment means any manner in which the symptoms of a 

10 condition, disorder or disease are ameliorated or otherwise beneficially altered. 

As used herein, a therapeutically effective product is a product that is 
encoded by heterologous DNA that, upon introduction of the DNA into a host, a 
product is expressed that effectively ameliorates or eliminates the symptoms, 
manifestations of an inherited or acquired disease or that cures said disease. 

15 As used herein, a subject is an animal, such as a mammal, typically a 

human, including patients. 

As used herein, genetic therapy involves the transfer of heterologous 
DNA to the certain cells, target cells, of a mammal, particularly a human, with a 
disorder or conditions for which such therapy is sought. The DNA is introduced 

20 into the selected target cells in a manner such that the heterologous DNA is 
expressed and a therapeutic product encoded thereby is produced. 
Alternatively, the heterologous DNA may in some manner mediate expression of 
DNA that encodes the therapeutic product, it may encode a product, such as a 
peptide or RNA that in some manner mediates, directly or indirectly, expression 

25 of a therapeutic product. Genetic therapy may also be used to deliver nucleic 
acid encoding a gene product to replace a defective gene or supplement a gene 
product produced by the mammal or the cell in which it is introduced. The 
introduced nucleic acid may encode a therapeutic compound, such as a growth 
factor inhibitor thereof, or a tumor necrosis factor or inhibitor thereof, such as a 

30 receptor therefor, that is not normally produced in the mammalian host or that is 
not produced in therapeutically effective amounts or at a therapeutically useful 
time. The heterologous DNA encoding the therapeutic product may be modified 
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prior to introduction into the cells of the afflicted host in order to enhance or 
otherwise alter the product or expression thereof. 

As used herein, a therapeutic nuucleic acid is a nucleic acid that endes a 
therapeutic product. The product can be nucleic acid, such as a regulatory 
5 sequence or gene, or can encode a protein that has a therapeutic activity or 
effect. For example, therapeutic nucleic acid can be a ribozyme, antisense, 
double-stranded RNA, a nucleic acid encoding a protein and others. 

As used herein, "homologous" means about greater than 25% nucleic 
acid sequence identity, such as 25% 40%, 60%, 70%, 80%, 90% or 95%. If 

10 necessary the percentage homology will be specified. The terms "homology" 

and "identity" are often used interchangeably. In general, sequences are aligned 
so that the highest order match is obtained (see, e.g.: Computational Molecular 
Biology, Lesk, A.M., ed., Oxford University Press, New York, 1988; 
Biocomputing: Informatics and Genome Projects. Smith, D.W., ed., Academic 

15 Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, 
A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence 
Analysis in Molecular Biology, von Heinje, G., Academic Press, 1 987; and 
Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991 ; Carillo et al. (1988) S/AM J Applied Math 45:1073). 

20 By sequence identity, the number of conserved amino acids are determined by 
standard alignment algorithms programs, and are used with default gap penalties 
established by each supplier. Substantially homologous nucleic acid molecules 
would hybridize typically at moderate stringency or at high stringency all along 
the length of the nucleic acid or along at least about 70%, 80% or 90% of the 

25 full-length nucleic acid molecule of interest. Also contemplated are nucleic acid 
molecules that contain degenerate codons in place of codons in the hybridizing 
nucleic acid molecule. 

Whether any two nucleic acid molecules have nucleotide sequences that 
are at least, for example, 80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% 

30 "identical" can be determined using known computer algorithms such as the 
"FAST A" program, using for example, the default parameters as in Pearson et 
al. (1988) Proc. Natl. Acad. Sci. USA 55:2444 (other programs include the GCG 
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program package (Devereux, J., et aL, Nucleic Acids Research 12(i):387 
(1984)), BLASTP, BLASTN, FASTA (Atschul, S.F., et aL, J Molec Biol 2 75:403 
(1990); Guide to Huge Computers, Martin J. Bishop, ed., Academic Press, San 
Diego, 1994, and Carillo et aL (1988) SIAM J Applied Math 48:1073). For 
5 example, the BLAST function of the National Center for Biotechnology 

Information database can be used to determine identity. Other commercially or 
publicly available programs include, DNAStar "Meg Align" program (Madison, Wl) 
and the University of Wisconsin Genetics Computer Group (UWG) "Gap" 
program (Madison Wl)). Percent homology or identity of proteins and/or nucleic 

10 acid molecules can be determined, for example, by comparing sequence 

information using a GAP computer program (e.g., Needleman et aL (1970) J. 
MoL BioL 48:443, as revised by Smith and Waterman ((1981) Adv. AppL Math. 
2:482). Briefly, the GAP program defines similarity as the number of aligned 
symbols (i.e., nucleotides or amino acids) which are similar, divided by the total 

15 number of symbols in the shorter of the two sequences. Default parameters for 
the GAP program can include: (Da unary comparison matrix (containing a value 
of 1 for identities and 0 for non-identities) and the weighted comparison matrix 
of Gribskov et aL (1986) NucL Acids Res. 14:6745, as described by Schwartz 
and Dayhoff, eds., ATLAS OF PROTEIN SEQUENCE AND STRUCTURE, National 

20 Biomedical Research Foundation, pp. 353-358 (1979); (2) a penalty of 3.0 for 
each gap and an additional 0.10 penalty for each symbol in each gap; and (3) no 
penalty for end gaps. Therefore, as used herein, the term "identity" represents a 
comparison between a test and a reference polypeptide or polynucleotide. 

As used herein, the term "at least 90% identical to" refers to percent 

25 identities from 90 to 99.99 relative to the reference polypeptides. Identity at a 
level of 90% or more is indicative of the fact that, assuming for exemplification 
purposes a test and reference polynucleotide length of 1 00 amino acids are 
compared, no more than 10% (i.e., 10 out of 100) of amino acids in the test 
polypeptide differs from that of the reference polypeptides. Similar comparisons 

30 can be made between a test and reference polynucleotides. Such differences 
can be represented as point mutations randomly distributed over the entire 
length of an amino acid sequence or they can be clustered in one or more 
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locations of varying length up to the maximum allowable, e.g. 1 0/1 00 amino 
acid difference (approximately 90% identity). Differences are defined as nucleic 
acid or amino acid substitutions, or deletions. At the level of homologies or 
identities above about 85-90%, the result should be independent of the program 
5 and gap parameters set; such high levels of identity can be assessed readily, 
often without relying on software. 

As used herein: stringency of hybridization in determining percentage 
mismatch is as follows: 

1) high stringency: 0.1 x SSPE, 0.1 % SDS, 65 °C 

10 2) medium stringency: 0.2 x SSPE, 0.1% SDS, 50°C 

3) low stringency: 1 .0 x SSPE, 0.1 % SDS, 50°C 
Those of skill in this art know that the washing step selects for stable 
hybrids and also know the ingredients of SSPE (see, e.g., Sambrook, E.F. 
Fritsch, T. Maniatis, in: Molecular Cloning, A Laboratory Manual. Cold Spring 

15 Harbor Laboratory Press (1989), vol. 3, p. B.13, see, also, numerous catalogs 
that describe commonly used laboratory solutions). SSPE is pH 7.4 phosphate- 
buffered 0.18 M NaCI. Further, those of skill in the art recognize that the 
stability of hybrids is determined by T m , which is a function of the sodium ion 
concentration and temperature (T m = 81 .5° C-1 6.6(log 10 [Na + ]) + 0.41 (%G + C)- 

20 600/D), so that the only parameters in the wash conditions critical to hybrid 
stability are sodium ion concentration in the SSPE (or SSC) and temperature. 

It is understood that equivalent stringencies can be achieved using 
alternative buffers, salts and temperatures. By way of example and not 
limitation, procedures using conditions of low stringency are as follows (see also 

25 Shilo and Weinberg, Proc. Natl. Acad. Sci. USA 75:6789-6792 (1981)): Filters 
containing DNA are pretreated for 6 hours at 40°C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCI (pH 7.5), 5 mM EDTA, 0.1 % PVP, 0.1% 
Ficoll, 1% BSA, and 500 //g/ml denatured salmon sperm DNA (10X SSC is 1.5 
M sodium chloride, and 0.15 M sodium citrate, adjusted to a pH of 7). 

30 Hybridizations are carried out in the same solution with the following 

modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 //g/ml salmon sperm 
DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 cpm 32 P-labeled probe is 
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used. Filters are incubated in hybridization mixture for 18-20 hours at 40°C, 
and then washed for 1 .5 hours at 55 °C in a solution containing 2X SSC, 25 mM 
Tris-HCI (pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash solution is replaced 
with fresh solution and incubated an additional 1.5 hours at 60°C. Filters are 
5 blotted dry and exposed for autoradiography. If necessary, filters are washed for 
a third time at 65-68 °C and reexposed to film. Other conditions of low 
stringency which can be used are well known in the art {e.g., as employed for 
cross-species hybridizations). 

By way of example and not way of limitation, procedures using 

10 conditions of moderate stringency include, for example, but are not limited to, 
procedures using such conditions of moderate stringency are as follows: Filters 
containing DNA are pretreated for 6 hours at 55 °C in a solution containing 6X 
SSC, 5X Denhart's solution, 0.5% SDS and 100//g/ml denatured salmon sperm 
DNA. Hybridizations are carried out in the same solution and 5-20 X 10 6 cpm 

15 32 P-labeled probe is used. Filters are incubated in hybridization mixture for 18-20 
hours at 55 °C, and then washed twice for 30 minutes at 60 °C in a solution 
containing 1X SSC and 0.1 % SDS. Filters are blotted dry and exposed for 
autoradiography. Other conditions of moderate stringency which can be used 
are well-known in the art. Washing of filters is done at 37 °C for 1 hour in a 

20 solution containing 2X SSC, 0.1% SDS. 

By way of example and not way of limitation, procedures using conditions 
of high stringency are as follows: Prehybridization of filters containing DNA is 
carried out for 8 hours to overnight at 65 °C in buffer composed of 6X SSC, 
50 mM Tris-HCI (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, 

25 and 500 //g/ml denatured salmon sperm DNA. Filters are hybridized for 48 hours 
at 65 °C in prehybridization mixture containing 100 //g/ml denatured salmon 
sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of filters is done 
at 37°C for 1 hour in a solution containing 2X SSC, 0.01 % PVP, 0.01 % Ficoll, 
and 0.01% BSA. This is followed by a wash in 0.1 X SSC at 50°C for 45 

30 minutes before autoradiography. Other conditions of high stringency which can 
be used are well known in the art. 
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The term substantially identical or substantially homologous or similar 
varies with the context as understood by those skilled in the relevant art and 
generally means at least 60% or 70%, preferably means at least 80%, 85% or 
more preferably at least 90%, and most preferably at least 95% identity. 
5 As used herein, substantially identical to a product means sufficiently 

similar so that the property of interest is sufficiently unchanged so that the 
substantially identical product can be used in place of the product. 

As used herein, substantially pure means sufficiently homogeneous to 
appear free of readily detectable impurities as determined by standard methods 

10 of analysis, such as thin layer chromatography (TLC), gel electrophoresis and 

high performance liquid chromatography (HPLC), used by those of skill in the art 
to assess such purity, or sufficiently pure such that further purification would 
not detectably alter the physical and chemical properties, such as enzymatic and 
biological activities, of the substance. Methods for purification of the 

15 compounds to produce substantially chemically pure compounds are known to 
those of skill in the art. A substantially chemically pure compound can, 
however, be a mixture of stereoisomers or isomers. In such instances, further 
purification might increase the specific activity of the compound. 

The methods and and preparation of products provided herein, unless 

20 otherwise indicated, employ conventional techniques of chemistry, molecular 

biology, microbiology, recombinant DNA, genetics, immunology, cell biology, cell 
culture and transgenic biology, which are within the skill of the art (see, e.g., 
Maniatis et al. (1 982) Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY); Sambrook et al. (1989) 

25 Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY; Ausubel et al. (1992) Current Protocols in Molecular 
Biology, Wiley and Sons, New York; Glover (1985) DNA Cloning I and II, Oxford 
Press; Anand (1992) Techniques for the Analysis of Complex Genomes 
(Academic Press); Guthrie and Fink (1991) Guide to Yeast Genetics and 

30 Molecular Biology, Academic Press; Harlow and Lane (1988) Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harobor, NY; 
Jakoby and Pastan, eds. (1979) Cell Culture. Methods in Enzymology 58, 
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Academic Press, Inc., Harcourt Brace Jaovanovich, NY; Nucleic Acid 
Hybridization (B. D. Hames & S. J. Higgins eds. 1984); Culture Of Animal Cells 
(R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL 
Press, 1986); B. Perbal (1984), A Practical Guide To Molecular Cloning; Gene 
5 Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Calos eds., 1 987, 
Cold Spring Harbor Laboratory); Methods In Enzymology, Vols. 154 and 155 
(Wu et al. eds.); Immunochemical Methods In Cell And Molecular Biology (Mayer 
and Walker, eds., Academic Press, London, 1987); Handbook Of Experimental 
Immunology, Volumes l-IV (D. M. Weir and C. C. Blackwell, eds., 1986); Hogan 

10 et aL (1986) Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y. 
B. Capsid modifications 

Provided herein are modifications of the viral capsid that ablate the 
interaction of an adenovirus with its natural receptors. In particular, fiber 

15 modifications that result in ablation of the interaction of an adenvirus with HSP 
are provided. These fiber modifications can be combined with other capsid 
protein modifications, such as other fiber modifications and/or penton and/or 
hexon modifications, to fully ablate viral interactions with natural receptors, 
when expressed on a viral particle. The modification should not disrupt trimer 

20 formation or transport of fiber into the nucleus. 
1 . Fiber genes and proteins 

The fiber protein extends from the capsid and mediates viral binding to 
the cell surface by binding to specific cell receptors (Philipson et aL (1 968) J. 
ViroL 2:1064-1075). The fiber is a trimeric protein that includes an N-terminal 

25 tail domain that interacts with the adenovirus penton base, a central shaft 

domain of varying length, and a C-terminal knob domain that contains the cell 
receptor binding site (Chroboczek et aL (1995) Curr. Top. Microbiol. Immunol. 
755:163 _200; Riurok et aL (1990) J. Mo L Biol. 275:589-596; Stevenson et aL 
(1995) J. ViroL 55:2850-2857). The sequences of the fiber gene from a variety 

30 of serotypes including adenovirus serotypes 2 (Ad2), Ad5, Ad3, Ad35, Ad12, 
Ad40, and Ad41 are known. There are at least 21 different fiber genes in 
Genbank. 



WO 03/062400 PCT/US03/02295 



-28- 

As noted, the fiber protein can be divided into three domains (see, e.g., 
Green et al. (1983) EMBO J. 2:1357-1365). The conserved N-terminus contains 
the sequences responsible for association with the penton base as well as a 
nuclear localization signal. A rod-like shaft of variable length contains repeats of 
5 a 1 5 amino acid beta structure, with the number of repeats ranging from 6 in 
Ad3 to 22 in Ad5. A conserved stretch of amino acids which includes the 
sequence TLWT (SEQ ID No. 36) marks the boundary between the repeating 
units of beta structure in the shaft and the globular head domain. The C-terminal 
head domain ranges in size from 1 57 amino acid residues for the short fiber of 

10 Ad41 to 193 residues for Adl 1 and Ad34. The fiber spike is a homotrimer and 
it is thought that the C-terminus is responsible for trimerization of the fiber 
homotrimer and there are 1 2 spikes per virion which are attached via association 
with the penton base complex. 

2. Modification of HSP interaction 

1 5 The adenovirus fiber protein is a major determinant of adenovirus tropism 

(Gall et al. (1 996) J. Virol. 70:2116-2123; Stevenson et al. (1 995) J. Virol. 
65:2850-2857). Dogma in the field has been that adenoviral entry occurs via 
binding to CAR and integrins. This is underscored by published data (Einfeld et 
al. (2001 ) J. Virology 75:1 1 284-1 1 291 ). it is shown herein, however, these 

20 published entry pathways are not the predominant ones that act in vivo. 

Moreover, as shown herein, the dominant entry pathway for hepatocytes in vivo 
involves a mechanism mediated by the fiber shaft, such as Ad5 shaft, through 
heparin sulfate proteoglycans binding. 

It is shown herein that elimination of this binding eliminates entry vis HSP 

25 binding, such as in hepatocytes. Adenoviral fiber shaft modifications that ablate 
viral interaction with HSP are provided. Thus, as provided herein, efficient 
detargeting of adenovirus in vivo can be achieved with appropriately designed 
fiber proteins. Suitable modifications, such as described herein, can be made 
with respect to any adenovirus in which the wild-type interacts with HSP. 

30 As provided herein, the ability of an adenoviral vector to interact with 

HSP is modified. In particular, the ability to interact is reduced or eliminated. 
Modifications include insertions, deletions, individual amino acid mutations and 
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other mutations that alter the structure of the fiber shaft such that the HSP 
binding of the modified fiber protein is ablated when compared to the HSP 
binding of the wild-type fiber protein. 

In a first aspect of this embodiment, an adenoviral fiber protein is 
5 modified by mutating one or more of the amino acids that interact with HSP. 

For example, the HSP binding motif of the modified fiber protein is no longer able 
to interact with HSP on the cell surface, thus ablating the viral interaction with 
HSP. For example, the adenoviral fiber is from a subgroup C adenovirus. 
Binding to HSP can be eliminated or reduced by mutating the fiber shaft in order 

10 to modify the ability of the HSP binding motif, which is, for example, KKTK 

sequence (SEQ ID No. 1) located between amino acid residues 91 to 94 in the 
Ad 5 fiber, to interact with HSP. The fiber proteins are modified by chemical 
and biological techniques known to those skilled in the art, such as site directed 
mutagenesis of nucleic aicd encoding the fiber or other techniques as illustrated 

15 herein. 

In another aspect of this embodiment, the ability of a fiber to interact 
with HSP is modified by replacing the wild-type fiber shaft with a fiber shaft, or 
portion thereof, of an adenovirus that does not interact with HSP to produce 
chimeric fiber proteins. The portion is sufficient to reduce or eliminate 

20 interaction with HSP. Examples of adenoviruses having fiber shafts that do not 
interact with HSP include (a) adenoviruses of subgroup B, such as, but are not 
limited to, Ad3, Ad35, Ad7, Ad1 1, Ad16, Ad21, Ad34, which do not have 
interaction with HSP, (b) adenoviruses of subgroup F, such as, but are not 
limited to, Ad40 and Ad41, specifically the short fiber, and (c) adenoviruses of 

25 subgroup D, such as but are not limited to, Ad46. In another embodiment, 
adenoviral fiber shaft modifications that ablate viral interaction with HSP in 
combination with adenoviral fiber knob modifications that ablate viral interactions 
with CAR are provided. Suitable adenoviral fiber modifications include the 
fiber knob modifications are known to those of skill in the art and are exemplified 

30 herein (see, also, US. Patent Application Serial No. 09/870,203, filed on 30 May 
2001, and published as U.S. Published application No. 20020137213, in 
International Patent Application No. PCT/EP0 1/06286 filed on 1 June 2001). 
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Modifications of the fiber include mutations of at least one amino acid in the CD 
loop of a wild-type fiber protein of an adenovirus from subgroup C, D, or E, or 
the long wild-type fiber of an adenovirus from subgroup F, whereby the ability of 
a fiber protein to bind to CAR is reduced or substantially eliminated. The fiber 
5 proteins with ablated CAR interaction are modified by chemical and biological 
techniques known to those skilled in the art, as illustrated herein and as 
described in the above patent application. 

Alternatively, adenoviral fiber modifications are made by replacing the 
wild-type fiber knob with a fiber knob of an adenovirus that does not interact 

10 with CAR. The fiber protein also will be selected so that it does not interact 
with HSP. Examples of adenoviruses having fiber knobs that do not interact 
with CAR include (a) adenoviruses of subgroup B, e.g., Ad3, Ad35, Ad7, Ad1 1, 
Ad16, Ad21, Ad34, (b) adenoviruses of subgroup F, e.g., Ad40 and Ad41, 
specifically the short fiber. 

1 5 in another embodiment, adenoviral fiber shaft modifications that ablate 

viral interaction with HSP in combination with penton modifications that ablate 
viral interactions with a v integrins are provided. Suitable adenoviral penton 
modifications include the penton modifications, which are well known to those^ 
of skill in the art (see, e.g., U.S. Patent No. 5,731,190; see, also Einfeld eta/. 

20 (2001 ) J. Virology 75-A 1 284-1 1 291 ; and Bai et a/. (1 993) J. Virology 
67:5198-5205). 

For example, penton interaction with cr v integrins can be ablated (reduced 
or eliminated) by substitution of the RGD tripeptide motif, required for cr v 
interaction, in penton with a different tripeptide that does not interact with an a v 
25 integrin. The penton proteins with ablated a v integrin interactions are modified 
by chemical and biological techniques known to those skilled in the art (see, 
e.g., described U.S. Patent No. 6,731,190 and as illustrated herein). Generally, 
the adenovirus is a subgroup B or C adenovirus. 

Also provided are adenoviral fiber shaft modifications that ablate viral 
30 interaction with HSP in combination with adenoviral fiber knob modifications that 
ablate viral interactions with CAR and with penton modifications that ablate viral 
interactions with a v integrins. These modifications are described above and 
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prepared using chemical and biological techniques known to those skilled in the 
art and as illustrated herein. Generally the adenovirus is a subgroup B or 
subgroup C adenovirus. 

Preparation of fibers modified to eliminate or reduce HSP interactions and 
5 fibers modified to alter interactions with other receptors and cell surface 
proteins, such as CAR and/or a v integrin, is also described in the Examples 
below. The nucleic acid and/or amino acid sequences of exemplary modified 
fibers, whose construction are described below) are set forth as SEQ ID Nos. 45- 
72 as follows: 

10 SEQ ID Nos. 45 and 46 set forth the encoding nucleotide sequence and 

amino acid sequence of the modified fiber designated 5FK01, where 5F refers to 
Adenovirus 5 fiber, KOI is an exemplary mutation of the CAR interaction site 
described herein; 

SEQ ID Nos. 47 and 48 set forth the encoding nucleotide sequence and 
15 amino acid sequence of the modified ber designated 5FK01RGD, which further 
includes an RGD ligand to demonstrate retargeting; 

SEQ ID Nos. 49 and 50 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK012, where 5F refers 
to Adenovirus 5 fiber, K012 is another exemplary mutation of the CAR 
20 interaction site described herein; 

SEQ ID Nos. 51 and 52 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5F S* nuc, where 5F 
refers to Adenovirus 5 fiber, S* is an exemplary mutation of the shaft that alters 
binding to HSP; 

25 SEQ ID Nos. 53 and 54 set forth the encoding nucleotide sequence and 

amino acid sequence of the modified fiber designated 5F S*RGD nuc, which 
further includes an RGD ligand; 

SEQ ID Nos. 55 and 56 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified ber designated 5FK01S*, which contain 

30 the KQ1 and S* mutations; 
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SEQ ID Nos. 57 and 58 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5FK01S*RGD / which 
further includes an RGD ligand; 

SEQ ID Nos. 59 and 60 set forth the encoding nucleotide sequence and 
5 amino acid sequence of a Ad35 fiber; 

SEQ ID Nos. 61 and 62 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 35FRGD, which is 35F 
fiber with an RGD ligand; 

SEQ ID Nos. 63 and 64 set forth the encoding nucleotide sequence and 
10 amino acid sequence of a Ad41 short fiber; 

SEQ ID Nos. 65 and 66 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 41sFRGD, which is 41 F 
short fiber with an RGD ligand; 

SEQ ID Nos. 67 and 68 set forth the encoding nucleotide sequence and 
15 amino acid sequence of Ad5 penton; 

SEQ ID Nos. 69 and 70 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 5TS35H, which is a 
chimeric fiber in which an Ad5 fiber tail and shaft regions (5TS; amino acids 1 to 
403) are connected to an Ad35 fiber head region (35H; amino acids 137 to 323) 
20 to form the 5TS35H chimera; and 

SEQ ID Nos. 71 and 72 set forth the encoding nucleotide sequence and 
amino acid sequence of the modified fiber designated 35TS5H, which is a 
chimeric fiber in which an Ad35 fiber tail and shaft regions (35TS; amino acids 1 
to 136) are connected to an Ad5 fiber head region (5H; amino acids 404 to 581) 
25 to form the 35TS5H chimera. 

SEQ ID No. 1 sets forth the nucleotide sequence of Ad fiber; SEQ ID Nos. 
2 and 3 also set forth the coding nucleic acid sequencs for fibers with modified 
fiber knobs for ablated CAR interaction (see, SEQ ID No. 2 for KOI and SEQ ID 
No. 3 for K012); SEQ ID No. 4 also sets for the encoding nucleic acid sequence 
30 of a modified penton for ablated a v integrins (SEQ ID No. 4). 

The modified fibers are displayed on virus particles by modifying the fiber 
protein and optionally additional proteins. This can be achieved by preparing 
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adenoviral vectors that express the modified capsid proteins and produce 

particles with modified fibers, or by packaging adenoviral vectors, particularly 

those that do not encode one or more capsid proteins in appropriate packaging 

lines. Hence, as discussed in detail below, adenoviral vectors and viral particles 

5 with modified fibers that do no bind to HSP are provided. 

C. Nucleic acids. Adenoviral vectors and cells containing the nucleic acids 
and cells containing the vectors 

■ Also provided are polynucleotides that encode modified capsid proteins 

and that encode vectors for preparation of adenovirus that express modified 

10 capsid proteins provided herein. The sequences of the wild-type adenovirus 

proteins are well known in the art and are modified as described herein. Nucleic 
acid molecules, such as cDNA that encode an exemplary modified fiber knob for 
ablated CAR interaction (see, SEQ ID No. 2 for K01 and SEQ ID No. 3 for K012) 
and for a modified penton for ablated a v integrins (SEQ ID No. 4) are provided. 

15 As discussed above, modified capsid proteins with altered tropism for CAR and 
a v integrins are known and described in the patents, applications and literature 
cited herein and known to those of skill in the art (see, e.g., U.S. Patent No. 
5,731,190, U.S. application Serial No. 09/870,203, published as U.S. Published 
application No. 20020137213; and Bai eta/. (1993) J. Virology 

20 67:5198-5208). 

Also provided are vectors including the polynucleotides provided herein. 
Such vectors include partial or complete adenoviral genomes and plasmids. Such 
vectors are constructed by techniques known to those skilled in the art and as 
illustrated herein. Also provided are adenoviral vectors modified by replacing 

25 whole fiber protein, or portions thereof, with the fiber proteins, or appropriate 
portions thereof, of an adenovirus that does not interact with HSP. 
Adenoviruses that do not interact with HSP can be identified by using the 
methods described herein which detect binding or non-binding of fiber proteins 
and adenoviruses with HSP. Among the adenoviral vectors provided herein are 

30 those of subgroup C, which include Ad2 and Ad5, in which the nucleic acid 
encoding the fiber shaft or a portion including the HSP-binding portion is 
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replaced with nucleic acid encoding fiber or an appropriate portion thereof from a 
serotype, such as Ad35. 

Adenoviral fiber modifications, thus, can be made in viral particles by 
replacing the entire fiber protein with the fiber protein of an adenovirus that does 
5 not interact with CAR and/or replacing the HSP binding portion with a portion 
that does not bind. Generally the adenovirus is a subgroup B or subgroup C 
adenovirus, and also an adenovirus of subgroup D, such as Ad46. Adenoviral 
vectors of subgroup C, such as Ad2 and Ad5, having a replaced fiber knob are 
prepared using techniques well known in the art and as illustrated herein. 

10 1 - Preparation of viral particles 

The packaging cells used to produce the viruses provided herein contain 
the nucleic acid encoding the capsid protein, including the mutated fiber protein 
provided herein. Such nucleic acid can be transfected into the cell, generally 
part of as part of plasmid, or it can be infected into the cell with a viral vector. It 

15 can be stably incorporated into the genome of the cell, thus providing for a 

stable cell line. Alternatively, nucleic acid encoding the mutated capsid protein 
can be removed from the genome, in which case a transient complementing cell 
is employed. 

The adenovirus genome to be packaged is transferred into the 
20 complementing cell by techniques known to those skilled in the art. These 

techniques include transfection or infection with the adenovirus. The nucleic acid 
encoding the mutated fiber protein can be in this genome instead of in the 
packaging cell. 

In certain cases, when the nucleic acid encoding the mutated fiber is in 
25 the genome to be packaged, it can be desirable for the packaging cell to also 
encode a fiber protein. Such protein can assist in the maturation and packaging 
of an infectious particle. Such protein can be a wild-type fiber protein or one 
modified such that it is unable to attach to the penton base protein and is for 
use, for example, in producer cells where the fiber is included to provide the 
30 packaging function and the vector encodes a full-length fiber. 

The packaging cells are cultured under conditions that permit the 
production of the desired viral particle. The viral particles are recovered by 
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standard techniques. An exemplary method for producing adenoviral particles 
provided herein is as follows. The nucleic acid encoding the mutated fiber 
protein is made using standard techniques in an adenoviral shuttle plasmid. This 
plasmid contains the right end of the virus, in particular from the end of the E3 
5 region through the right ITR. This plasmid is co-transfected into competent cells 
of an E. coli strain, such as the well known E. coli strain BJ5183 (see, e.g., 
Degryse (1996) Gene 7 70:45-50) along with a plasmid, which contains the 
remaining portion of the adenovirus genome, except for the E1 region and 
sometimes also the E2a region and also contains a corresponding region of 

10 homology. Homologous recombination between the two plasmids generates a 
full-length plasmid encoding the entire adenoviral vector genome. 

This full-length adenoviral vector genome plasmid is then transfected into 
a complementing cell line. The transfection can be performed in the presence of 
a reagent that directs adenoviral particle entry into producer cells. Such 

15 reagents include, but are not limited to, polycations and bifunctional reagents, 
such as those described herein. A complementing cell is, for example, is a cell 
of the PER.C6 cell line, which contains the adenoviral E1 gene (PER.C6 is 
available, for example, from Crucell, The Netherlands; deposited under ECACC 
accession no. 96022940; see, also Fallaux etal. (1998) Hum. Gene Ther. 

20 3:1909-1907; see, also, U.S. Patent No. 5,994,128) or an AE1-2a cell (see, 
Gorziglia etal. (1996) J. Virology 70:4173-4178; and and Von Seggern etal. 
(1 998) J. Gen. Virol. 73:1461-1468)). 

AE1-2a cells are derivatives of the A549 lung carcinoma line (ATCC # 
CCL 185) with chromosomal insertions of the plasmids pGRE5-2.E1 (also 

25 referred to as GRE5-E1-SV40-Hygro construct and listed in SEQ ID No. 41) and 
pMNeoE2a-3.1 (also referred to as MMTV-E2a-SV40-Neo construct and listed in 
SEQ ID No. 42), which provide complementation of the adenoviral E1 and E2a 
functions, respectively. 

The 633 cell line (see, von Seggern et al. (2000) J. Virology 

30 74:354-362), which stably expresses the adenovirus serotype 5 wild-type fiber 
protein, and was derived from the AE1-2a cell line, is another an example of 
complementing cells. When the cell line is 633 cells, the final passage of 
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adenoviral vector is performed on another complementing cell line {e.g., Per.C6), 
which does not express wild-type Ad5 fiber. 

Jhe transfected complementing cells are maintained under standard cell 
culture conditions. The adenoviral plasmids recombine to form the adenoviral 
5 genome that is packaged. The particles are infectious, but replication deficient 
because their genome is missing at least the E1 genes. When performed in the 
633 cells the particles contain wild-type and mutated fiber proteins. They are 
recovered from the crude viral lysate, amplified, and are purified by standard 
techniques. 

10 The recovered particles can be used to infect PER.C6 or AE1-2a cells. 

This permits the recovery of particles whose capsids contain only the desired 
mutated fiber. This two-step procedure provides high titer batches of the 
adenoviral particles provided herein. The adenoviral particles can be replication 
competent or replication incompetent. 

15 In one embodiment, the particles selectively replicate in certain 

predetermined target tissue but are replication incompetent in other cells and 
tissues. In a particular embodiment, the adenoviral particles replicate in 
abnormally proliferating tissue, such as solid tumors and other neoplasms. In 
replication conditional adenoviruses, a gene essential for replication is placed 

20 under control of a heterologous promoter which is cell or tissue specific. For 

example, the El a gene is placed under control of a promoter which is active in a 
tumor cell to produce an oncolytic adenovirus or oncolytic adenoviral vector. 
Administration of oncolytic adenoviral vectors to tumor cells kills the tumor cells. 
Such replication conditional adenoviral particles and vectors can be produced by 

25 techniques known to those skilled in the art, such as those disclosed in the 

above-referenced U.S. Patent Nos. 5,998,205 and 5,801,029. These particles 
and vectors can be produced in adenoviral packaging cells as disclosed above. 
Generally packaging cells are those that have been designed to limit homologous 
recombination that could lead to wild-type adenoviral particles. Such cells are 

30 well known and include the packaging cell known as PER.C6 (see, e.g., U.S. 

Patent Nos. 5,994,128 and 6,033,908; deposited under ECACC accession no. 
96022940). Since oncolytic vectors are replication competent in certain cell 
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types, they can be amplified in cell lines derived from said cell type without 
provision of Ad complementary genes. 

2. Adenoviral vectors and particles 

The adenovirus as used herein for production of the adenoviral vectors 
5 and particles can be of any serotype. Adenoviral stocks that can be employed 
as a source of adenovirus or adenoviral coat protein, such as fiber and/or penton 
base, can be amplified from the adenoviral serotypes 1 through 47, which are 
currently available from the American Type Culture Collection (ATCC, Rockville, 
Md.), or from any other serotype of adenovirus available from any other source. 

10 For instance, an adenovirus can be of subgroup A (e.g., serotypes 12, 18, 31), 
subgroup B (e.g., serotypes 3, 7, 1 1 , 14, 1 6, 21 , 34, 35), subgroup C (e.g., 
serotypes 1, 2, 5, 6), subgroup D (e.g., serotypes 8, 9, 10, 13, 15, 17, 19, 20, 
22-30, 32, 33, 36-39, 42-47), subgroup E (serotype 4), subgroup F (serotype 
40, 41), or any other adenoviral serotype. 

15 in certain embodiments, the adenovirus is a subgroup Bora subgroup C 

adenovirus. Subgroup C adenoviruses which are modified in as described herein, 
include, but are not limited to, Ad2 and Ad5. For Ad5, the mutation is made in 
the KKTK sequence (SEQ ID No. 1) located between amino acid residues 91 to 
94. The fiber proteins can be modified by chemical and biological techniques 

20 known to those skilled in the art. These methods include, but are not limited to, 
site directed mutagenesis and techniques as illustrated herein. 

The adenoviral particle generally includes a targeting ligand as described 
above. The presence of the targeting ligand permits the delivery of a gene to a 
desired cell type which is different from the cell type that wild-type adenovirus 

25 particles infect or the same as that a wild-type particle infects, but allowing the 
infection in a selective manner, i.e., non-target cell types are not significantly 
infected. 

The adenoviral vectors provided herein can be used to study cell 
transduction and gene expression in vitro or in various animal models. The latter 
30 case includes ex vivo techniques, in which cells are transduced in vitro and then 
administered to the animal. They also can be used to conduct gene therapy on 
humans or other animals. Such gene therapy can be ex vivo or in vivo. For in 
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vivo gene therapy, the adenoviral particles in a pharmaceutically-acceptable 
carrier are delivered to a human in a therapeutically effective amount in order to 
prevent, treat, or ameliorate a disease or other medical condition in the human 
through the introduction of a heterologous gene that encodes a therapeutic 
5 protein into cells in such human. The adenoviruses are delivered at a dose 
ranging from approximately 1 particle per kilogram of body weight to 
approximately 10 14 particles per kilogram of body weight. Generally, they are 
delivered at a dose of approximately 10 6 particles per kilogram of body weight to 
approximately 10 13 particles per kilogram of body weight, and typically the dose 
10 ranges from approximately 10 8 particles per kilogram of body weight to 
approximately 10 12 particles per kilogram of body weight. 

Any vectors known to those of skill in the art can be employed and used 
to produce viral particles that include fibers modified to ablate (including reduce) 
binding to HSP. Some exemplary vectors are as follows. 
15 a. Gutless vectors 

Gutted adenovirus vectors are those from which most or all viral genes 
have been deleted. They are grown by co-infection of the producing cells with a 
"helper" virus (such as using an El-deleted Ad vector), where the packaging cells 
expresses the E1 gene products. The helper virus trans- complements the 
20 missing Ad functions, including production of the viral structural proteins needed 
for particle assembly. To incorporate the capsid modifications into a gutted 
adenoviral vector capsid, the changes must be made to the helper virus as 
described herein. All the necessary Ad proteins including the modified capsid 
protein are provided by the modified helper virus, and the gutted adenovirus 
25 particles are equipped with the particular modified capsid expressed by the host 
cells. The E1a, Eb, E2a, E2b and E4 are generally required for viral replication 
and packaging. If these genes are deleted, then the packaging cell must provide 
these genes or functional equivalents. 

A helper adenovirus vector genome and a gutless adenoviral vector 
30 genome are delivered to packaging cells. The cells are maintained under 
standard cell maintenance or growth conditions, whereby the helper vector 
genome and the packaging cell together provide the complementing proteins for 
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the packaging of the adenoviral vector particle. Such gutless adenoviral vector 
particles are recovered by standard techniques. The helper vector genome can 
be delivered in the form of a plasmid or similar construct by standard 
transfection techniques, or it can be delivered through infection by a viral particle 
5 containing the genome. Such viral particle is commonly called a helper virus. 
Similarly, the gutless adenoviral vector genome can be delivered to the cell by 
transfection or viral infection. 

The helper virus genome can be the modified adenovirus vector genome 
as disclosed herein. Such genome also can be prepared or designed so that it 
10 lacks the genes encoding the adenovirus El A and E1 B proteins. In addition, the 
genome can further lack the adenovirus genes encoding the adenovirus E3 
proteins. Alternatively, the genes encoding such proteins can be present but 
mutated so that they do not encode functional E1A, E1B and E3 proteins. 
Furthermore, such vector genome can not encode other functional early proteins, 
15 such as E2A, E2B3, and E4 proteins. Alternatively, the genes encoding such 
other early proteins can be present but mutated so that they do not encode 
functional proteins. 

In producing the gutless vectors, the helper virus genome is also 
packaged, thereby producing helper virus. In order the minimize the amount of 
20 helper virus produced and maximize the amount of gutless vector particles 

produced, the packaging sequence in the helper virus genome, can be deleted or 
otherwise modified so that packaging of the helper virus genome is prevented or 
limited. Since the gutless vector genome will have an unmodified packaging 
sequence, it will be preferentially packaged. 
25 One way to do this is to mutate the packaging sequence by deleting one 

or more of the nucleotides comprising the sequence or otherwise mutating the 
sequence to inactivate or hamper the packaging function. One exemplary 
approach is to engineer the helper genome so that recombinase target sites flank 
the packaging sequence and to provide a recombinase in the packaging cell. The 
30 action of recombinase on such sites results in the removal of the packaging 

sequence from the helper virus genome. The recombinase can be provided by a 
nucleotide sequence in the packaging cell that encodes the recombinase. Such 
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sequence can be stably integrated into the genome of the packaging cell. 
Various kinds of recombinase are known by those skilled in the art, and include, 
but are not limited to, Cre recombinase, which operates on so-called lox sites, 
which are engineered on either side of the packaging sequence as discussed 
5 above (see, e.g., U.S. Patent Nos. 5,919, 676, 6,080,569 and 5,919,676; see, 
also, e.g., Morsy and Caskey, Molecular Medicine Today, Jan. 1999, pgs. 
18-24). 

An example of a gutless vector is pAdARSVDys (Haecker eta/. (1996) 
Hum Gene Ther. 7:1907-1914)). This plasmid contains a full-length human 

10 dystrophin cDNA driven by the RSV promoter and flanked by Ad inverted 
terminal repeats and packaging signals. 293 cells are infected with a 
first-generation Ad, which serves as a helper virus, and then transfected with 
purified pAdARSVDys DNA. The helper Ad genome and the pAdARSVDys DNA 
are replicated as Ad chromosomes, and packaged into particles using the viral 

15 proteins produced by the helper virus. Particles are isolated and the 

pAdARSVDys-containing particles separated from the helper by virtue of their 
smaller genome size and therefore different density on CsCI gradients. Other 
examples of gutless adenoviral vectors are known (see, e.g., Sandig eta/. 
(2000) Proc. Natl. Acad. Sci. U.S.A. 97(3)^002-7). 

20 b- Oncolytic vectors 

Briefly, oncolytic adenoviruses, which are viruses that replicate selectively 
in tumor cells, are designed to amplify the input virus dose due to viral 
replication in the tumor, leading to spread of the virus throughout the tumor 
mass. In situ replication of adenoviruses leads to cell lysis. This in situ replication 

25 permits relatively low, non-toxic doses to be highly effective in the selective 

elimination of tumor cells. One approach to achieving selectivity is to introduce 
loss-of-function mutations in viral genes that are essential for growth in 
non-target cells but not in tumor cells. (See, e.g., U.S. Patent No. 5,801,029.) 
This strategy is exemplified by the use of Addl1520, which has a deletion in the 

30 E1 b-55KD gene. In normal cells, the adenoviral E1b-55KD protein is needed to 
bind to p53 to prevent apoptosis. In p53-deficient tumor cells, E1b-55K binding 
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to p53 is unnecessary. Thus, deletion of E1b-55KD should restrict vector 
replication to p53-deficient tumor cells. 

Another approach is to use tumor-selective promoters to control the 
expression of early viral genes required for replication (see, e.g., International 
5 PCT application Nos. WO 96/17053 and WO 99/25860). Thus, in this approach 
the adenoviruses selectively replicate and lyse tumor cells if the gene that is 
essential for replication is under the control of a promoter or other transcriptional 
regulatory element that is tumor-selective. 

For example oncolytic adenoviral vectors that contain a cancer selective 

10 regulatory region operatively linked to an adenoviral gene essential for adenoviral 
replication are known (see, e.g., U.S. Patent No. 5,998,205). Adenoviral genes 
essential for replication include, but are not limited to, E1a, E1b, E2a, E2b and 
E4. For example, an exemplary oncolytic adenoviral vector has a cancer 
selective regulatory region operatively linked to the E1a gene. In other 

15 embodiments, the oncolytic adenoviral vector has a cancer selective regulatory 
region of the present invention operatively linked to the E1a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
also can include at least one therapeutic transgene, such as, but not limited to, a 
polynucleotide encoding a cytokine such as GM-CSF that can stimulate a 

20 systemic immune response against tumor cells. 

Other exemplary oncolytic adenoviral vectors include those in which 
expression of an adenoviral gene, which is essential for replication, is controlled 
by E2F-responsive promoters, which are selectively transactivated in cancer 
cells. Thus, vectors that contains an adenoviral nucleic acid backbone that 

25 contains in sequential order: A left ITR, an adenoviral packaging signal, a 
termination signal sequence, an E2F responsive promoter which is operably 
linked to a first gene, such as E1a, essential for replication of the recombinant 
viral vector and a right ITR (see, published International PCT application No. 
WO02/06786, and U.S. Patent No. 5,998,205). 

30 In other embodiments, the oncolytic adenoviral vector has a cancer 

selective regulatory region operatively linked to the E1a gene and a second 
cancer selective regulatory region operatively linked to the E4 gene. The vectors 
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can also carry at least one therapeutic transgene, such as, but not limited to, a 
polynucleotide encoding a cytokine such as GM-CSF that can stimulate a 
systemic immune response against tumor cells. 
3. Packaging 

5 The viral particles provided herein can be made by any method known to 

those of skill in the art. Generally they are prepared by growing the adenovirus 
vector that contains nucleic acid that encodes the modified fiber protein in a 
standard adenovirus packaging cells to produce particles that express the 
modified fibers. Alternatively, the vectors do not encode fibers. Such vectors 
10 are packaged in producer cells to produce particles that express the modified 
fiber proteins. 

As discussed, recombinant adenoviral vectors generally have at least a 
deletion in the first viral early gene region, referred to as E1 , which includes the 
E1a and E1b regions. Deletion of the viral E1 region renders the recombinant 

15 adenovirus defective for replication and incapable of producing infectious viral 
particles in subsequently-infected target cells. Thus, to generate E1 -deleted 
adenovirus genome replication and to produce virus particles requires a system 
of complementation which provides the missing E1 gene product. E1 
complementation is typically provided by a cell line expressing E1, such as the 

20 human embryonic kidney packaging cell line, i.e. an epithelial cell line, called 
293. Cell line 293 contains the E1 region of adenovirus, which provides E1 
gene region products to "support" the growth of E1 -deleted virus in the cell line 
(see, e.g., Graham et al., J. Gen. Virol. 36: 59-71, 1977). Additionally, cell lines 
that may be usable for production of defective adenovirus having a portion of 

25 the adenovirus E4 region have been reported (WO 96/22378). Multiply deficient 
adenoviral vectors and complementing cell lines have also been described (WO 
95/34671, U.S. Patent No. 5,994,106). 

For example, copending U.S. application Serial No. 09/482,682 (also 
filed as International PCT application No. PCT/EPOO/00265, filed January 14, 

30 200, published as International PCT application No. WO/0042208) provides 

packaging cell lines that support viral vectors with deletions of major portions of 
the viral genome, without the need for helper viruses and also provides cell lines 
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and helper viruses for use with helper-dependent vectors. The packaging cell 
line has heterologous DNA stably integrated into the chromosomes of the cellular 
genome. The heterologous DNA sequence encodes one or more adenovirus 
regulatory and/or structural polypeptides that complement the genes deleted or 
5 mutated in the adenovirus vector genome to be replicated and packaged. 
Packaging cell lines express, for example, one or more adenovirus structural 
proteins, polypeptides, or fragments thereof, such as penton base, hexon, fiber, 
polypeptide Ilia, polypeptide V, polypeptide VI, polypeptide VII, polypeptide VIII, 
and biologically active fragments thereof. The expression can be constitutive or 

10 under the control of a regulatable promoter. These cell lines are particularly 
designed for expression of recombinant adenoviruses intended for delivery of 
therapeutic products. For use herein, such packaging cell lines can express the 
modified capsid proteins, such as the fiber proteins who binding to HSP is 
reduced or eliminated, and/or the modified penton and hexon proteins. 

15 Particular packaging cell lines complement viral vectors having a deletion 

or mutation of a DNA sequence encoding an adenovirus structural protein, 
regulatory polypeptides E1 A and E1B, and/or one or more of the following 
regulatory proteins or polypeptides: E2A, E2B, E3, E4, L4, or fragments thereof. 
The packaging cell lines are produced by introducing each DNA molecule 

20 into the cells and then into the genome via a separate complementing plasmid or 
plurality of DNA molecules encoding the complementing proteins can be 
introduced via a single complementing plasmid. Of interest herein, is a variation 
in which the complementing plasmid includes DNA encoding adenovirus fiber 
protein (or a chimeric or modified variant thereof), from Ad virus of subgroup D, 

25 such as Ad 37, polypeptide or fragment thereof. 

For applications, such as therapeutic applications, the delivery plasmid 
further can include a nucleotide sequence encoding a heterologous polypeptide. 
Exemplary delivery plasmids include, but are not limited to, pDV44, pAE1B£-gal 
and pAE1sp1B. In a similar or analogous manner, therapeutic nucleic acids, 

30 such as nucleic acids that encode therapeutic genes, can be introduced. 
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The cell further includes a complementing plasmid encoding a fiber as 
contemplated herein; the plasmid or portion thereof is integrated into a 
chromosome(s) of the cellular genome of the cell. 

Typically, the packaging cell lines will contain nucleic acid encoding the 
5 fiber protein or modified protein stably integrated into a chromosome or 

chromosomes in the cellular genome. The packaging cell line can be derived from 
a procaryotic cell line or from a eukaryotic cell line. While various embodiments 
suggest the use of mammalian cells, and more particularly, epithelial cell lines, a 
variety of other, non-epithelial cell lines are used in various embodiments. Thus, 
10 while various embodiments disclose the use of a cell line selected from among 
the 293, A549, W1 62, HeLa, Vero, 21 1 , and 21 1 A cell lines, and any other cell 
lines suitable for such use are likewise contemplated herein. 
D. Addition of a targeting ligand 

The viral particles that are detargeted as described herein, can be 
15 retargeted to selected cells and/or tissues by inclusion of an appropriate 

targeting ligand in the capsid. The ligand cam be included in any of the capsid 
proteins, such as fiber, hexon and penton. Loci for inclusion of nucleic acid 
encoding a is known to those of skill in the art for a a variety of adenovirus 
serotypes; if necessary appropriate loci and other parameters can be empirically 
20 determined. 

The ligand can be produced as a fusion by inclusion of the coding 
sequences in the nucleic acid encoding a capsid protein, or chemically 
conjugated, such as via ionic, covalent or other interactions, to the capsid or 
bound to the capsid (e.g., by Ab-ligand fusion, where Ab binds capsid protein; or 
25 by disulfide bonding or other crosslinking moieties or chemistries). 

Thus, for example, a modified fiber nucleic acid also can include 
sequences of nucleotides that encode a targeting ligand to produce viral particles 
that include a targeting ligand in the capsid. Targeting ligand and methods for 
including such ligands in viral capids are well known. For example, inclusion of 
30 targeting ligands in fiber proteins is described in U.S. Patent Nos. 5,543,328 and 
5,756,086 and in U.S. Patent Application Serial No. 09/870,203, published as 
U.S. Published application No. 20020137213, and International Patent 
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Application No. PCT/EP01 /06286. For different serotypes and strains of 
adenoviruses, loci for insertion of targeting ligands can be empirically 
determined. For different serotypes and strains, such loci can vary. 

Because the adenovirus fiber has a trimeric structure, the ligand can be 
5 selected or designed to have a trimeric structure so that up to three molecules of 
the ligand are present for each mature fiber. Such ligands can be incorporated 
into the fiber protein using methods known in the art (see, e.g., U.S. Patent No. 
5,756,086). Instead of the fiber, the targeting ligand can be included in the 
penton or hexon proteins. Inclusion of targeting ligands in penton (see for 

10 example, in U.S. Patent Nos. 5,731,190 and 5,965,431) and in hexon (see for 
example, in U.S. Patent No. 5,965,541) is known. 

In one exemplary embodiment, the ligand is included in a fiber protein, 
which is a fiber protein mutated as described herein. As shown herein, the 
targeting ligand can be included, for example, within the HI loop of the fiber 

15 protein. Any ligand that can fit in the HI loop and still provide a functional virus 
is contemplated herein. Such ligands can be as long as or longer than 80-100 
amino acids (see, e.g., Belousova eta/. (2002) J. Virol. 76:8621-8631). Such 
ligands are added by techniques known in the art (see, e.g., published 
International Patent Application publication No. W099/39734 and U.S. Patent 

20 Application number 09/482,682). Other ligands can be be discovered through 
techniques known to those skilled in the art. Some non-limiting examples of 
these techniques include phage display libraries or by screening other types of 
libraries. 

Targeting ligands include any chemical moiety that preferentially directs 
25 an adenoviral particle to a desired cell type and/or tissue. The categories of such 
ligands include, but are not limited to, peptides, polypeptides, single chain 
antibodies, and multimeric proteins. Specific ligands include the TNF superfamily 
of ligands which include tumor necrosis factors (or TNF's) such as, for example, 
TNFar and TNF/?, lymphotoxins (LT), such as LT-a and LT->ff, Fas ligand which 
30 binds to Fas antigen; CD40 ligand, which binds to the CD40 receptor of 

B-lymphocytes; CD30 ligand, which binds to the CD30 receptor of neoplastic 
cells of Hodgkin's lymphoma; CD27 ligand, NGF ligand, and OX-40 ligand; 
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transferrin, which binds to the transferrin receptor located on tumor cells, 
activated T -cells, and neural tissue cells; ApoB, which binds to the LDL receptor 
of liver cells; alpha-2-macroglobulin, which binds to the LRP receptor of liver 
cells; alpha-l acid glycoprotein, which binds to the asialoglycoprotein receptor of 
5 liver; mannose-containing peptides, which bind to the mannose receptor of 
macrophages; sialyl-Lewis-X antigen-containing peptides, which bind to the 
ELAM-I receptor of activated endothelial cells; CD34 ligand, which binds to the 
CD34 receptor of hematopoietic progenitor cells; ICAM-I, which binds to the 
LFA-I (CD11b/CD18) receptor of lymphocytes, or to the Mac-I (CD11a/CD18) 

10 receptor of macrophages; M-CSF, which binds to the c-fms receptor of spleen 
and bone marrow macrophages; circumsporozoite protein, which binds to 
hepatic Plasmodium falciparum receptor of liver cells; VLA-4, which binds to the 
VCAM-I receptor of activated endothelial cells; HIV gp120 and Class II MHC 
antigen, which bind to the CD4 receptor of T -helper cells; the LDL receptor 

15 binding region of the apolipoprotein E (ApoE) molecule; colony stimulating factor, 
or CSF, which binds to the CSF receptor; insulin-like growth factors, such as 
IGF-I and IGF-II, which bind to the IGF-I and IGF-II receptors, respectively; 
Interleukins 1 through 14, which bind to the Interleukin 1 through 14 receptors, 
respectively; the Fv antigen-binding domain of an immunoglobulin; gelatinase 

20 (IS/IMP) inhibitor; bombesin, gastrin-releasing peptide; substance P; somatostatin; 
luteinizing hormone releasing hormone (LHRH); vasoactive peptide (VIP); gastrin; 
melanocyte stimulating hormone (MSH); cyclic RGD peptide and any other ligand 
or cell surface protein-binding (or targeting) molecule. 
E. Heterologous polynucleotides and Therapeutic Nucleic Acids 

25 The packaged adenoviral genome also can contain a heterologous 

polynucleotide that encodes a product of interest, such as a therapeutic protein. 
Adenoviral genomes containing heterologous polynucleotides are well known 
(see, e.g., U.S. Patent Nos. 5,998,205, 6,156,497, 5,935,935, and 
5,801 ,029). These can be used for in vitro and in vivo delivery of the products 

30 of heterlogous polynucleoties or the heterologous polynucleotides. 

Thus, the adenoviral particles provided herein can be used to engineer a 
cell to express a protein that it otherwise does not express or does not express 
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in sufficient quantities. This genetic engineering is accomplished by infecting the 
desired cell with an adenoviral particle whose genome includes a desired 
heterologous polynucleotide. The heterologous polynucleotide is then expressed 
in the genetically engineered cells. For use herein the cell is generally a 
5 mammalian cell, and is typically a primate cell, including a human cell. The cell 
can be inside the body of the animal {in vivo) or outside the body [in vitro). 
Heterologous polynucleotides (also referred to as heterologous nucleic acid 
sequences) are included in the adenoviral genome within the particle and are 
added to that genome by techniques known in-the art. Any heterologous 

10 polynucleotide of interest can be added, such as those disclosed in U.S. Patent 
No. 5,998,205, incorporated herein by reference. Polynucleotides that are 

introduced into an Ad genome or vector can be any that encode a protein of 
interest or that are regulatory sequences. Proteins include, but are not limited 
to, therapeutic proteins, such as an immunostimulating protein, such as an 

15 interleukin, interferon, or colony stimulating factor, such as granulocyte 
macrophage colony stimulating factor (GM-CSF; see, e.g., 5,908,763F. 
Generally, such GM-CSF is a primate GM-CSF, including human GM-CSF. Other 
immunostimulatory genes include, but are not limited to, genes that encode 
cytokines IL1, IL2, IL4, IL5, IFN, IFN, TNF, IL12, IL18, and flt3), proteins that 

20 stimulate interactions with immune cells (B7, CD28, MHC class I, MHC class II, 
TAPs), tumor-associated antigens (immunogenic sequences from MART-1 , 
gp100(pmel-17), tyrosinase, tyrosinase-related protein 1 , tyrosinase-related 
protein 2, melanocyte-stimulating hormone receptor, MAGE1, MAGE2, MAGE3, 
MAGE12, BAGE, GAGE, NY-ESO-1, -catenin, MUM-1, CDK-4, caspase 8, KIA 

25 0205, HLA-A2R1701, -fetoprotein, telomerase catalytic protein, G-250, MUC-1, 
carcinoembryonic protein, p53, Her2/neu, triosephosphate isomerase, CDC-27, 
LDLR-FUT, telomerase reverse transcriptase, and PSMA), cDNAs of antibodies 
that block inhibitory signals (CTLA4 blockade), chemokines (MIP1, MIP3, CCR7 
ligand, and calreticulin), and other proteins. 

30 Other polynucleotides, including therapeutic nucleic acids, such as 

therapeutic genes, of interest include, but are not limited to, anti-angiogenic, and 
suicide genes. Anti-angiogenic genes include, but are not limited to, genes that 
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encode METH-1, METH -2, TrpRS fragments, proliferin-related protein, prolactin 
fragment, PEDF, vasostatin, various fragments of extracellular matrix proteins 
and growth factor/cytokine inhibitors. Various fragments of extracellular matrix 
proteins include, but are not limited to, angiostatin, endostatin, kininostatin, 
5 fibrinogen-E fragment, thrombospondin, tumstatin, canstatin, and restin. Growth 
factor/cytokine inhibitors include, but are not limited to, VEGF/VEGFR 
antagonist, sFlt-1 , sFlk, sNRP1 , angiopoietin/tie antagonist, sTie-2, chemokines 
(IP-10, PF-4, Gro-beta, IFN-gamma (Mig), IFN, FGF/FGFR antagonist (sFGFR), 
Ephrin/Eph antagonist (sEphB4 and sephrinB2), PDGF, TGF and IGF-1 . 

10 A "suicide gene" encodes a protein that can lead to cell death, as with 

expression of diphtheria toxin A, or the expression of the protein can render cells 
selectively sensitive to certain drugs, e.g., expression of the Herpes simplex 
thymidine kinase gene (HSV-TK) renders cells sensitive to antiviral compounds, 
such as acyclovir, gancyclovir and FIAU (1-(2-deoxy-2-fluoro-.beta.- 

15 D-arabinofuranosil)-5-iodouracil). Other suicide genes include, but are not limited 
to, genes that encode carboxypeptidase G2 (CPG2), carboxylesterase (CA), 
cytosine deaminase (CD), cytochrome P450 (cyt-450), deoxycytidine kinase 
(dCK), nitroreductase (NR), purine nucleoside phosphorylase (PNP), thymidine 
phosphorylase (TP), varicella zoster virus thymidine kinase (VZV-TK), and 

20 xanthine-guanine phosphoribosyl transferase (XGPRT). Alternatively, a 

therapeutic nucleic acid can exert its effect at the level of RNA, for instance, by 
encoding an antisense message or ribozyme, a protein that affects splicing or 3' 
processing (e.g., polyadenylation), or a protein that affects the level of 
expression of another gene within the cell, e.g. by mediating an altered rate of 

25 mRNA accumulation, an alteration of mRNA transport, and/or a change in 

post-transcriptional regulation. The addition of a therapeutic nucleic acid to a 
virus results in a virus with an additional antitumor mechanism of action. Thus, 
a single entity (i.e.. the virus carrying a therapeutic transgene) is capable of 
inducing multiple antitumor mechanisms. Other encoded proteins, include, but 

30 are not limited to, herpes simplex virus thymidine kinase (HSV-TK), which is 
useful as a safety switch (see, U.S. Patent Application No. 08/974,391 , filed 
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November 19, 1997, which published as PCT Publication No. WO/9925860), 

Nos, FasL, and sFasR (soluble Fas receptor). 

Also contemplated are combinations of two or more transgenes with 

synergistic, complementary and/or nonoverlapping toxicities and methods of 
5 action. The resulting adenovirus can retain the viral oncolytic functions and, for 

example, additionally are endowed with the ability to induce immune and 

anti-angiogenic responses and other responses as desired. 

Therapeutic polynucleotides and heterologous polynucleotides also 

include those that exert an effect at the level of RNA or protein. These include 
10 include a factor capable of initiating apoptosis, RNA, such as RNAi and other 

double-stranded RNA, antisense and ribozymes, which among other capabilities 

can be directed to mRNAs encoding proteins essential for proliferation, such as 

structural proteins, transcription factors, polymerases, genes encoding cytotoxic 

proteins, genes that encode an engineered cytoplasmic variant of a nuclease 
15 (e.g. RNase A) or protease (e.g. trypsin, papain, proteinase K and carboxy- 

peptidase). Other polynucleotides include a cell or tissue specific promoters, 

such as those used in oncolytic adenoviruses (see, e.g., U.S. Patent No. 

5,998,205). 

The heterologous polynucleotide encoding a polypeptide also can contain 
20 a promoter operably linked to the coding region. Generally the promoter is a 
regulated promoter and transcription factor expression system, such as the 
published tetracycline-regulated systems, or other regulatable systems (WO 
01/30843), to allow regulated expression of the encoded polypeptide. 
Exemplary of other promoters, are tissue-selective promoters, such as those 
25 described in U.S. Patent No. 5,998,205. An exemplary regulatable promoter 
system is the Tet-On( and Tet-Off ( systems currently available from Clontech 
(Palo Alto, CA). This promoter system allows the regulated expression of the 
transgene controlled by tetracycline or tetracycline derivatives, such as 
doxycyciine. This system can be used to control the expression of the encoded 
30 polypeptide in the viral particles and nucleic acids provided herein. Other 
regulatable promoter systems are known (see, e.g., published U.S. No. 
20020168714, entitled "Regulation of Gene Expression Using Single-Chain, 
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Monomeric, Ligancl Dependent Polypeptide Switches," which describes gene 
switches that contain ligand binding domains and transcriptional regulating 
domains, such as those from hormone receptors). Other suitable promoters that 
can be employed include, but are not limited to, adenoviral promoters, such as 
5 the adenoviral major late promoter and/or the E3 promoter; or heterologous 
promoters, such as the cytomegalovirus (CMV) promoter; the Rous Sarcoma 
Virus (RSV) promoter; inducible promoters, such as the MMT promoter, the 
metallothionein promoter; heat shock promoters; the albumin promoter; and the 
ApoAl promoter. 

10 Therapeutic transgenes can be included in the viral constructs and 

resulting particles. Among these are those that result in an "armed" virus. For 
example, rather than delete E3 region as in some embodiments described herein, 
all or a part of the E3 region can be preserved or re-inserted in an oncolytic 
adenoviral vector (discussed above). The presence of all or a part of the E3 

15 region can decrease the immunogenicity of the adenoviral vector. It also 

increases cytopathic effect in tumor cells and decreases toxicity to normal cells. 
Typically such vector expresses more than half of the E3 proteins. 

Adenoviruses for therapy, including those for human therapy, are known. 
Such known viruses can be modified as provided herein to reduce or eliminate 

20 interaction with HSPs and optionally additional receptors. The adenoviral vectors 
that are used to produce the viral particles can include other modifications. 
Modifications include modifications to the adenovirus genome that is packaged 
in the particle in order to make an adenoviral vector. As discussed above, 
adenovirus vectors and particles with a variety of modifications are available. 

25 Modifications to adenvoiral vectors include deletions known in the art, such as 
deletions in one or more of the El, E2a, E2b, E3, or E4 coding regions. These 
adenoviruses are sometimes referred to as early generation adenoviruses, include 
those with deletions of all of the coding regions of the adenoviral genome 
("gutless" adenoviruses, discussed above) and also include replication-condi- 

30 tional adenoviruses, which are viruses that replicate in certain types of cells or 

tissues but not in other types as a result of placing adenoviral genes essential for 
replication under control of a heterologous promoter (discussed above; see, also 
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U.S. Patent No. 5,998,205, U.S. Patent No. 5,801,029; U.S. patent application 
60/348,670 and corresponding published International PCT application No. 
WO02/O6786). These include the cytolytic, cytopathic viruses (or vectors), 
including the oncolytic viruses discussed above. 
5 Alternatively, as discussed above, the vector can include a mutation or 

deletion in the E1b gene. Typically such mutation or deletion in the E1b gene is 
such that the E1b-19kD protein becomes non-functional. This modification of 
the E1b region can be combined with vectors where all or a part of the E3 region 
is present. 

10 The oncolytic adenoviral vector can further include at least one 

heterologous coding sequence, such as one that encodes a therapeutic product. 
The heterologous coding sequence, such as therapeutic gene, is generally, 
although not necessarily, in the form of cDNA, and can be inserted at any locus 
that does not adversely affect the infectivity or replication of the vector. For 

15 example, it can be inserted in an E3 region in place of at least one of the 

polynucleotide sequences that encode an E3 protein, such as, for example, the 
19kD or 14.7 kD E3 gene. 
F. Propagation and Scale-up 

Since doubly ablated adenoviral vectors containing mutations in the fiber 

20 and/or penton capsid proteins result in inefficient cell binding and entry via the 
CAR/av integrin entry pathway, scaled up technologies improve the growth and 
propagation of such vectors to produce high titers of the adenoviral vectors for 
clinical use. Thus, also provided is a method for scaling up the production of 
detargeted adenoviral vectors. The detargeted adenoviral vectors comprise an 

25 adenoviral vector modified to ablate the interaction of said vector with at least 
one host cell receptor compared with a wild-type adenoviral vector. The 
detargeted adenoviral vectors can comprise an adenoviral vector modified to 
ablate the interaction of said vector with one, two, three or more host cell 
receptors. Thus, the method is suitable for producing the detargeted adenoviral 

30 vectors disclosed herein. 

As noted, growth and propagation of doubly and fully ablated adenoviral 
vectors is enhanced by new scale up technologies. Doubly ablated vectors 
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contain mutations in the fiber and penton capsid proteins that result in inefficient 
cell binding and entry via the normal cellular entry pathway using CAR and 
integrins. These vectors are fully detargeted in vitro and, thus, alternative 
cellular entry strategies allow for the efficient growth and generation of high titer 

5 preparations. 

Two strategies have been envisioned to scale up vectors that are 
detargeted via fiber and/or penton modifications. These include: (a) the use of 
pseudoreceptor cell lines engineered to express a surface receptor that binds a 
ligand displayed on the vector (see. e.g.. International PCT application No. WO 

10 98/54346) and (b) complementing cell lines that are engineered to express 

native fiber and that can be engineered to express native fiber and penton (see, 
e.g., International PCT application No. WO 00/42208). Although these systems 
have shown promise for scaling up ablated adenoviral vectors, there is a need to 
develop a system for the simple, efficient production of the fully detargeted 

15 adenoviral vector for therapeutic uses. 

Provided herein is a scale-up method for the propagation of detargeted 
adenoviral vectors. The method uses polycations and/or bifunctional reagents, 
which when added to tissue culture medium, bind adenoviral particles and direct 
their entry into the producer cells. 

20 Reagents (also called medium additives) also can be included in the tissue 

culture medium containing producer cells to be infected with the detargeted 
adenoviral vectors. Alternatively the reagents can be pre-mixed with the virus, 
which mixture is then added to the tissue producer cells. The reagents can be 
added to tissue culture medium containing producer cells, or producer cells can 

25 be added to tissue culture medium containing the reagents. Any suitable 

producer cell known to the skilled artisan can be used in the present methods. 
The reagents can be added at the same time that the producer cells are infected 
with detargeted adenoviral vectors. Generally the reagents are present in the 
tissue culture medium prior to infection by the detargeted adenoviral vectors. 

30 The medium additives are maintained in the tissue culture medium during vector 
growth, spread and propagation. High titer yields of adenoviral vectors are 
obtained by this method. 
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Reagents which are useful in this method are those that are capable of 
directing adenoviral particle entry into the producer cells. Such reagents include, 
but are not limited to, polycations and Afunctional reagents. Suitable 
polycations include, but are not limited to, polytheylenimine; protamine sulfate; 
5 poly-L-lysine hydrobromide; poly(dimethyl diallyl ammonium) chloride 

(Merquat(r)-IOO, Merquat(r)280, Merquat(r)550); poIy-L-arginine hydrochloride; 
poly-L-histidine; poly(4-vinylpyridine), poly(4-vinylpyridine) hydrochloride; 
poly(4-vinyIpyridine)cross-linked, methylchloride quaternary salt; po!y(4-vinyl- 
: pyridine-co-styrene); poly(4-vinylpyridinium poly(hydrogen fluoride)); poly(4-vinyl- 

10 pyridinium-P-toluene sulfonate); poly(4-vinyIpyridinium-tribromide); poly(4-vinyl- 
pyrrolidone-co-2-dimethyiamino-ethyl methacrylate); polyvinylpyrrolidone, 
cross-linked; poly vinylpyrrolidone, poly<melamine-co-formaldehyde); partially 
methylated; hexadimethrine bromide; poly(Glu, Lys) 1 :4 hydrobromide; poly(Lys, 
Ala) 3:1 hydrobromide; po!y(Lys, Ala) 2:1 hydrobromide; poly-L-lysine 

15 succinylated; poly{Lys, Ala) 1:1 hydrobromide; and poly (Lys, Trp) 1:4 
hydrobromide. 

Suitable bifunctional reagents include, but are not limited to, antibodies or 
peptides that bind to the adenoviral capsid and that also contain a ligand that 
allows interaction with specific cell surface receptors of the producer cells. 
20 Examples of bifunctional reagents include: (a) anti-fiber antibody ligand fusions, 
(b) anti-fiber-Fab-FGF conjugate, (c) anti-penton-antibody ligand fusions, (d) 
anti-hexon antibody ligand fusions and <e) polylysine-peptide fusions. The ligand 
is any ligand that will bind to any cell surface receptor found on the producer 
cells. 

25 

The following examples are included for illustrative purposes only and are 
not intended to limit the scope of the invention. 
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EXAMPLE 1 

Construction of Ad5 Vectors Containing the Fiber AB Loop, KOI and Penton, 
PD1 Mutations and Derivatives Thereof 

Three recombinant adenoviral vectors were prepared that contain the KOI 
5 fiber or PD1 penton base mutations either alone or in combination, these vectors 
are designated Av3nBgFK01 Av1nBgPD1, and Av1 nBgFKOI PD1 . Construction 
of these vectors is described below and a general description of each vector is 
set forth in Table 1 . 

TABLE 1 

10 Description Of Detargeted 

Recombinant Adenoviral Vectors Used For Scale-up 

Vector 



Vector 


Description 


Av3nBg 


An E1, E2a, E3-deleted adenoviral vector encoding a nuclear 
localizing /?-ga1actosidase 


AvInBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing j£-galactosidase 


Av3nBgFK01 


The same as Av3nBg but containing the K01 mutation in the 
fiber gene 


AvInBgPDI 


The same as AvInBg but containing the PD1 mutation in the 
penton gene 


Av1 nBgFKOI PD1 


The same as AvInBg but containing the fiber KOI and penton 
PD1 mutations 

i ■ ■ 1 



20 AvInBg 

This is a well-known vector, its sequence is set forth in SEQ ID No. 43. 
Av3nBg 

This is a well-known vector, its sequence is set forth in SEQ ID No. 44. 
Av3nBgFK01 

25 Genetic incorporation of the KOI fiber mutation to generate Av3nBgFK0 1 

The adenoviral vector Av3nBgFK01 was generated in an E1-, E2a-, 
E3-deleted backbone based on the adenovirus serotype 5 genome. It contains a 
RSV promoted nuclear-localizing yff-galactosidase gene in place of the E1 region. 
In addition, the fiber gene carries the K01 mutation. This mutation results in a 
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substitution of fiber amino acids 408 and 409, changing them from serine and 
proline to glutamic acid and alanine, respectively. 

The vector was constructed as follows. First, the plasmid pSKOI (Figure 
1) was digested with the restriction enzymes Sphl and Muni. The resulting DNA 
5 fragments were separated by electrophoresis on an agarose gel. The 1601 bp 
fragment containing all but the 5' end of the fiber gene was excised from the 
agarose gel and the DNA was isolated and purified. The fragment was then 
ligated with the 9236 bp fragment of p5FloxHRFRGD, which had been digested 
with Sphl and Muni. The resulting plasmid, pBFIoxHRFKOI , was digested with 

10 ( Spel and Pad and the 6867 bp fragment containing the fiber gene was isolated. 
The fragment was ligated with the 24,630 bp Spel-Pacl fragment of pNDSQ3.1 . 
The resulting plasmid, pNDSQ3.1K01 (Figure 2), was used together with 
pAdmireRSVnBg (Figure 3A) to generate a plasmid which encodes the full-length 
adenoviral vector genome. It, however, was necessary to remove the Pad site 

15 from pNDSQ3.1K01 (Figure 2) prior to recombination with pAdmireRSVnBg 

(Figure 3A) so that the final plasmid contains a unique Pad site adjacent to the 
5' ITR. The Pad site in pNDSQ3.1K01 was removed by digestion with Pad 
followed by blunting with T4 DNA Polymerase and religation. The resulting 
plasmid was called pNDSQ3.1 KOI (Pac. 

20 To generate a full-length plasmid containing the entire adenoviral genome, 

pAdmireRSVnBg (Figure 3A) was digested with Sail and co-transfected into 
competent cells of the E. coli strain BJ5183 along with pNDSQ3.1K01 APac, 
which had been digested with BstBI. Homologous recombination between the 
two plasmids generated a full-length plasmid encoding the entire adenoviral 

25 vector genome, which was called pFLAv3nBgFK01 . 

The plasmid pFLAv3nBgK01 was linearized with Pad and transfected into 
633 cells. In the fiber complementing 633 cell line, the resulting viral DNA 
containing the KOI mutation is capable of being packaged into infectious viral 
particles containing a mixture of wildtype fiber and mutant fiber proteins. After 

30 five rounds of amplification in 633 cells, a cytopathic effect was observed. 
Three more rounds of amplification in 633 cells were performed followed by 
purification of the virus by standard CsCI centrifugation procedures. This viral 
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preparation was used to infect AE1-2a cells, which do not express fiber. The 
resulting virus contained only the mutant fiber protein on its capsid. Virus 
particles were purified by standard CsCI centrifugation procedures. 
Av1nBgFK01 

5 The v ector AvInBgFKOI is made in a similar manner to Av3nBgFK01 

described above. 
Av1nBgK012 

An additional fiber AB loop mutation (described by Einfeld et al. (2001) J: 
Virology 75'A 1284-1 1291) was incorporated into the genome of AvInBg. This 

10 AB loop mutation is a four amino acid substitution, R512S, A515G, E516G, and 
K517G, and is referred to as K012. The K012 mutation was incorporated into 
the fiber gene by PCR gene overlap extension using the plasmid pSQ1 (Figure 
3B) as template. The pSQ1 plasmid contains most of the Ad5 genome, 
extending from base pair 3329 through the right ITR, in a pBR322 backbone. 

15 First, a segment of the Ad5 genome extending from within the E3 region into the 
fiber gene was amplified by PCR using the plasmid pSQ1 as a template with the 
following primers termed 5FF, 5'-GAA CAG GAG GTG AGC TTA GA-3' SEQ ID 
No. 4), and 5FR, 5'-TCC GCC TCC ATT TAG TGA ACA GTT AGG AGA TGG 
AGC TGG TGT G-3' (SEQ ID No. 6). The primer 5FR contains an 18 base 

20 5'-extension that encodes the modified fiber AB loop amino acids from 512 
through 517. A second PCR using pSQ1 as a template amplified the region 
immediately 3' of the AB loop substitution and extending past the Muni site 
located 40 base pairs 3' of the fiber gene stop codon. The two primers used for 
this reaction were 3FF: 5'-TCA CTA AAT GGA GGC GGA GAT GCT AAA CTC 

25 ACT TTG GTC TTA AC-3' (SEQ ID No. 7), and 3FR: 5'-GTG GCA GGT TGA 
ATA CTA GG-3' (SEQ ID No.8). The primer 3FR contains an 18 base 
5'. extens j on that encodes the modified fiber AB loop amino acids 512 through 
517. Amplified products of the expected size were obtained and used in a 
second PCR with the end primers 5FF and 3FR to join the fragments together. 

30 The K012 PCR fragment was digested with Xbal and Muni cloned directly into 
the fiber shuttle plasmid, pFBshuttle(EcoRI) to generate the plasmid pFBSEK012 
which contains the 8.8kB EcoRI fragment of pSQ1 . The pFBSEKQ12 plasmid 
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was digested with Xbal and EcoRI and cloned into pSQ1 using a three-way 
ligation to generate pSQ1K012 (Figure 3C). The K012 cDNA was incorporated 
into the genome of AvInBg, an adenovirus vector with E1 and E3 deleted 
encoding /?-galactosidase, by homologous recombination between Clal-linearized 
5 pSQ1K012 and pAdmireRSVnBg digested with Sail and Pad to generate 
Av1nBgK012. The K012 vector was transfected in 633 cells, scaled-up on 
non-fiber expressing cells and purified, as described above for KOI. 
Av1nBgPD1 

Genetic incorporation of the PD1 penton mutation to generate 
10 AvInBgPDI 

The adenoviral vector AvInBgPDI is an E1-, E3-deleted vector based on 

the adenovirus serotype 5 genome. It contains a RSV promoted 

nuclear-localizing jff-galactosidase gene in the E1 region and also contains the 

PD1 mutation in the penton gene. The PD1 mutation results in a substitution of 

15 amino acids 337 through 344 of the penton protein, HAIRGDTF (SEQ ID No. 9), 
with amino acids SRGYPYDVPDYAGTS (SEQ ID No. 10), thus replacing the RGD 
tripeptide (see, Einfeld et al. (2001 ) J. Virology 75:11284-11291). The 
mutation in the penton gene was generated in the plasmid pGEMpenB, which 
contains the Adenovirus serotype 5 penton gene. To generate the mutation, 

20 four oligonucleotides were synthesized. The sequences of the oligonucleotides 
were as follows: penton 1 : 5' CGC GGA AGA GAA CTC CAA CGC GGC AGC 
CGC GGC AAT GCA GCC GGT GGA GGA CAT GAA 3' (SEQ ID No. 11); penton 
2: 5' TAT CGT TCA TGT CCT CCA CCG GCT GCA TTG CCG CGG CTG CCG 
CGT TGG AGT TCT CTT CC 3' (SEQ ID No. 1 2); penton 3: 5' CGA TAG CCG 

25 CGG CTA CCC CTA CGA CGT GCC CGA CTA CGC GGG CAC CAG CGC CAC 
ACG GGC TGA GGA GAA GCG CGC 3' (SEQ ID No. 13); penton 4: 5' TCA GCG 
CGC TTC TCC TCA GCC CGT GTG GCG CTG GTG CCC GCG TAG TCG GGC 
ACG TCG TAG GGG TAG CCG CGG C 3' (SEQ ID No. 14). The complementary 
oligonucleotides penton 1 and penton 2 were annealed to each other as were 

30 penton 3 and penton 4. The duplex generated by annealing penton 3 and penton 
4 encoded the substitution of amino acids 337 through 344 described above. 
The duplex generated by annealing penton 1 and penton 2 possessed a 5 base 
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5' overhang which was compatible to a 5 base 5' overhang on the duplex 
generated by annealing penton 3 and penton 4. The opposite end of the duplex 
generated by annealing penton 1 and penton 2 contained an Earl compatible 
overhang. The opposite end of the duplex generated by annealing penton 3 and 
5 penton 4 contained a BbvCI compatible overhang. The two duplexes were 

ligated to each other and ligated back into the pGEMpen5 backbone as follows. 
First, pGEMpen5 was digested with BbvCI and Pstl and the resulting DNA 
fragments were separated by electrophoresis on an agarose gel. The 3360 bp 
fragment was excised from the gel and purified. The plasmid pGEMpen5 was 
10 also digested with Pstl and Earl and the resulting fragments were separated by 
electrophoresis on an agarose gel. The 955 bp fragment was excised from the 
gel and purified. .These two fragments from the pGEMpen5 plasmid were ligated 
with the two pairs of annealed oligonucleotides to generate the plasmid 
pGEMpen5PD1. 

15 The mutated penton gene was transferred from pGEMpen5PD1 to pSQ1 

using a 5-way ligation as follows. First, the region of the penton gene 
containing the PD1 mutation was excised from pGEMpen5PD1 by digestion with 
Pvul and Ascl. The 974 bp fragment containing the PD1 mutation was purified. 
1 Four DNA fragments were prepared from the pSQ1 plasmid (Figure 3B) as 

20 follows. The plasmid was digested with Csp45l and Fsel and the 9465 bp 

fragment was purified. In addition pSQ1 was digested with Fsel and Pvul and 
the 21 26 bp fragment was purified. The plasmid pSQ1 was digested with Ascl 
and BamHI and the 5891 bp fragment was purified. Finally, pSQI was digested 
with BamHI and Csp45l and the 14610 bp fragment was purified. The 5 purified 

25 DNA fragments were ligated to each other to form the plasmid pSQ1PD1 (Figure 
4). 

To generate adenoviral vector, pSQ1PD1 was linearized by digestion with 
Clal and co-transfected into PerC6 cells with pAdmireRSVnBg (Figure 3A) which 
had been digested with Sail and Pad. hexadimethrine bromide was maintained 
30 in the medium at 4 //g/ml. When a cytopathic effect was observed, a crude viral 
lysate was further expanded on PerC6 cells. The virus was purified by standard 
CsCI centrifugation procedures. 
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Av 1 nBgFK0 1 PD 1 

Genetic incorporation of the fiber KOI or K012 mutation in combination 
with the penton PD1 mutation to generate Av1 nBgFK0 1 PD1 

The adenoviral vectors Av1 nBgFKOI PD1 and Av1 nBgKOI 2PD1 were 

5 generated in an E1-, E3-deleted adenovirus serotype 5 genome. Both vectors 

contains a RSV promoted nuclear-localizing jff-galactosidase gene in the E1 region 

and also contains either the K01 or K012 mutation in the fiber gene as well as 

the PD1 mutation in the penton gene. The vectors were constructed as follows. 

First, the plasmid pSQ1PD1 was digested with Csp45l and Spel and the 23976 

10 bp fragment containing the PD1 mutated penton gene was purified. In addition, 
the plasmids pSQIKOI or pSQ1KO!2 (Figure 3B) were digested with Csp45l 
and Spel and the 9090 bp fragment containing the KOI or K012 mutated fiber 
gene were purified. The appropriate purified fragments were ligated to each 
other to from the plasmid pSQIFKOIPDI (Figure 5A) or pSQ1K012PD1 (Figure 

15 5B) that contains the KOI (or K012) mutated fiber gene and the PD1 mutated 
penton gene. To generate virus, pSQIFKOIPDI orpSQK012PD1 was linearized 
with Clal and co-transfected into 633 cells with pAdmireRSVnBg (Figure 3A) 
which had been digested with Sail and Pad. After three rounds of amplification 
in 633 cells a cytopathic effect was observed and the crude viral lysate was 

20 then amplified on PerC6 cells. Hexadimethrine bromide was maintained in the 
medium at 4 //g/ml. Each virus was purified by standard CsCI centrifugation 
procedures. 

EXAMPLE 2 

In Vitro Evaluation of Adenoviral Vectors Containing the KOI and PD1 Mutations 
25 Several recombinant adenoviral vectors were used in these studies to 

demonstrate the function of the KOI fiber mutation and included AvInBg, 
Avl nBgFKOI , Av1nBgPD1 , and Av1 nBgFKOI PD1 , described above. The 
transduction efficiencies of adenoviral vectors containing the KOI and/or PD1 
mutations were evaluated on cells of the alveolar epithelial cell line A549. The 
30 transduction efficiencies were compared to that of AvInBg, an adenoviral vector 
containing wild type fiber and penton. 
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The day prior to infection, cells were seeded into 24-well plates at a 
density of approximately 1 x 10 5 cells per well. Immediately prior to infection, 
the exact number of cells per well was determined by counting a representative 
well of cells. Each of the vectors, AvInBg, AvInBgFKOI, and AvInBgFKOI PD1 
5 were used to transduce A549 cells at each of the following particle per cell 
(PPC) ratios: 100, 500, 1000, 2500, 5000, 10,000. The cell monolayers were 
stained with X-gal 24 hours after infection and the percentage of cells 
expressing >?-galactosidase was determined by microscopic observation and 
counting of cells. Transductions were done in triplicate and three random fields 
10 in each well were counted, for a total of nine fields per vector. 

The results at the 500 PPC ratio are shown in Figure 6 and show a 
significantly reduced transduction efficiency on A549 cells using vectors 
containing the KOI mutation alone or when combined with PD1 compared to 
AvInBg. The vectors containing the PD1 mutation alone had no effect on 
15 adenoviral transduction of A549 cells in vitro. 

EXAMPLE 3 

in Vivo Analysis of Adenoviral Vectors Containing the FKOI and PD1 Mutations 

This Example provides experiments that evaluate the in vivo 
biodistribution of adenoviral vectors containing the KOI and PD1 mutations and 

20 their influence on adenoviral-mediated liver transduction. The results show that 
ablating the viral interaction with CAR and/or integrins is not sufficient to fully 
detarget adenoviral vectors from the liver in vivo. 

A positive control cohort received AvInBg and a negative control group 
received HBSS. Additionally, the AvInBgFKOI 2 and Av1 nBgFKOI 2PD1 vectors 

25 were analyzed in vivo. These vectors each contain a fiber protein with the four 
amino acid substitution in the AB loop. Additionally, Av1nBgFK012PD1 
contains a mutation in the penton base. Both of these mutations were known 
(see, Einfeld et al. (2001 ) J. Virology 75:1 1 284-1 1 291 ), and were alleged to 
decrease liver transduction 10 to 700 fold, respectively. Cohorts of five 

30 C57BL/6 mice received each vector via tail vein injection at a dose of 1 x 10 1 

particles per kg. The animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, and 
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kidney were collected from each animal. The median lobe of the liver was placed 
in neutral buffered formalin to preserve the sample for /?-galactosidase 
immunohistochemistry. In addition, tissue from each organ was frozen to 
preserve it for hexon PCR analysis to determine vector content. A separate 
5 sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent /?-galactosidase activity assay. 

For /?-galactosidase immunohistochemistry slices of liver, approximately 
2-3 mm thick, were placed in 10% neutral buffered formalin. After fixation, 
these samples were embedded in paraffin, sectioned, and analyzed by 

10 immunohistochemistry for /?-galactosidase expression. A 1 :1 200 dilution was 
used of a rabbit anti-£-galactosidase antibody (ICN Pharmaceuticals, Inc.; Costa 
Mesa, CA) in conjunction with a Vectastain ABC kit (Vector Laboratories, Inc., 
Burlingame, CA) to visualize positive cells. 

The chemiluminescent /?-galactosidase activity assay was performed 

15 using the Galacto-Light PlusTM chemiluminescent assay (Tropix, Inc., Foster 
City, CA) system. Tissue samples were collected in lysis matrix tubes 
containing two ceramic spheres (Bio101, Carlsbad, CA) and frozen on dry ice. 
The tissues were thawed and 500 jj\ of lysis buffer from the Galacto-Light Plus 
kit was added to each tube. The tissue was homogenized for 30 seconds using a 

20 FastPrep System (BiolOI, Carlsbad, CA). Liver samples were homogenized for 
an additional 30 seconds, jff-galactosidase activity was determined in the liver 
homogenates according to the manufacture's protocol. 

For hexon PCR analysis DNA from tissues was isolated using the Qiagen 
Blood and Cell Culture DNA Midi or Mini Kits (Qiagen Inc., Chatsworth, CA). 

25 Frozen tissues were partially thawed and minced using sterile disposable 

scalpels. Tissues were then lysed by incubation overnight at 55° C in Qiagen 
buffer G2 containing 0.2 mg/ml RNaseA and 0.1 mg/ml protease. Lysates were 
vortexed briefly and then applied to Qiagen-tip 100 or Qiagen-tip 25 columns. 
Columns were washed and DNAs were eluted as described in the manufacturer's 

30 instructions. After precipitation, DNAs were dissolved in water and the 

concentrations were spectrophotometrically determined (A260 and A280) on a 
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DU-600 (Beckman Coulter, Inc.; Fullerton, CA) or a SPECTRAmax PLUS 
(Molecular Devices, Inc.; Sunnyvale, CA) spectrophotometer. 2.3.2. 

PCR primers and a Taqman probe specific to adenovirus hexon sequences 
were designed using Primer Express software v. 1 .0 (Applied Biosystems, Foster 
5 City, CA). Primer and probe sequences were: 

Hexon Forward primer: 5'-CTTCGATGATGCCGCAGTG-3' (SEQ ID No. 38); 
Hexon Reverse primer: 5'-GGGCTCAGGTACTCCGAGG-3' (SEQ ID No. 39); and 
Hexon Probe: 5'-FAM-TTACATGCACATCTCGGGCCAGGAC-TAMRA-3' (SEQ ID No. 

40). 

10 Amplification was performed in a reaction volume of 50 p\ under the 

following conditions: 10 ng (tumor) or 1 /jq (liver and lung) of sample DNA, 1X 
Taqman Universal PCR Master Mix (Applied Biosystems), 600 nM forward 
primer, 900 nM reverse primer and 100 nM hexon probe. Thermal cycling 
conditions were: 2 minute incubation at 50° C, 10 minutes at 95° C, followed by 

15 35 cycles of successive incubation at 95° C for 1 5 seconds and 60° C for 1 
minute. Data was collected and analyzed using the 7700 Sequence Detection 
System software v. 1.6.3 (Applied Biosystems). Quantification of adenovirus 
copy number was performed using a standard curve that includes dilutions of 
adenovirus DNA from 1 ,500,000 copies to 1 5 copies in the appropriate 

20 background of cellular genomic DNA. For analysis of tumor tissues, a standard 
curve in a background of 10 ng human DNA was generated. For analysis of 
mouse liver and lung tissues, a standard curve using the same adenovirus DNA 
dilutions in a background of 1 JJ9 CD-1 mouse genomic DNA was generated. 
Samples were amplified in triplicate, and the average number of total copies was 

25 normalized to copies per cell based on the input DNA weight amount and a 
genome size of 6 x 10 9 bp. 

The results of the £-galactosidase activity assay and adenoviral hexon 
DNA content for liver transduction by these vectors are shown in Figure 7A and 
7B. The vector containing the K01 or K012 mutations alone showed, on 

30 average, a slight increase in liver transduction compared to Av1 nBg, which is 
consistent with several previous experiments. The vectors containing the PD1 
mutation alone or combined with KOI or K0 12 showed a slight decrease in liver 
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transduction compared to AvInBg, suggesting that integrins are involved to 
some extent in hepatic uptake of the adenoviral vectors. 

The results of the immunohistochemical staining of liver sections for 
^-galactosidase were consistent with the activity assays (data not shown) and 
5 demonstrate that gene expression was localized specifically to hepatocytes. The 
vectors containing the KOI or K012 mutation alone showed a slight increase in 
liver transduction as revealed by a more intense and frequent 
immunohistochemical-staining pattern. The vectors containing the PD1 
mutation, either alone or combined with K01 or K012, showed little difference 

10 in transduction compared to AvInBg. These results demonstrate that 

ablating the viral interaction with CAR and/or integrins is not sufficient to fully 
detarget adenoviral vectors from the liver in vivo. 

In summary, the fiber AB loop mutation contained in AvInBgFKOI or 
Av1nBgK012 ablates interaction with human and mouse CAR in vitro and 

15 diminished transduction in vitro. In vivo t however, fiber AB loop mutations 
behaved unexpectantly, because such mutations were found to enhance 
adenoviral-mediated gene transfer to liver and results in increasing vector 
potency. The penton base, PD1 mutation that ablates interaction with the 
second receptor involved in adenoviral internalization had no effect in vitro and 

20 little to no effect in vivo. These studies indicated that other receptors are 
responsible for adenoviral gene transfer to the liver in vivo. 

EXAMPLE 4 

Description Of Adenoviral Vectors Containing A Fiber With Amino Acid 
Substitutions At The Heparin Sulfate Binding Domain In The Fiber Shaft 

25 Vectors containing substitutions at all four of the amino acids in the four 

amino acid motif in the Ad5 fiber shaft (residues 91 to 94, KKTK; SEQ ID No. 1) 
were generated in order to ablate the potential interaction with HSP. The 
mutation is termed HSP because it potentially eliminates binding to heparan 
sulfate proteoglycans. Vectors containing the HSP mutation alone and combined 

30 with the KOI mutation (fiber knob AB loop mutation that ablates CAR binding), 
the PD1 mutation (penton mutation that eliminates RGD/integrin interaction), and 
a triple knockout vector (HSP, KOI, PD1) were generated. 



WO 03/062400 



9 



PCTAJS03/02295 



-64- 



Generation of the HSP fiber mutation: The HSP mutation was 
incorporated into the fiber gene by using a PCR-based strategy of gene splicing 
by overlap extension (PCR SOEing). First, a segment of the Ad5 genome 
extending from within the E3 region into the 5' end of the fiber gene was 
5 amplified by PCR using the plasmid pSQ1 (Figure 3B) as a template and two 
primers termed 5FF and 5HSPR. The DNA sequence of 5FF is as follows: 5' 
GAA CAG GAG GTG AGC TTA GA 3' (SEQ ID No. 5). This sequence 
corresponds to base pairs 25,199 - 25,218 of P SQ1 . The DNA sequence of 
5HSPR is as follows: 5' GGC TCC GGC TCC GAG AGG TGG GCT CAC AGT 

10 GGT TAC ATT T 3' (SEQ ID No. 1 5). 5HSPR is a reverse primer for 5FF and 
corresponds to a region in the fiber shaft adjacent to the KKT-K (SEQ ID No. 1 ) 
region. The primer contains a 5' extension that encodes a GAGA substitution 
for the native KKTK (encoded by SEQ ID No. 1) amino acid sequence. A second 
PCR using pSQ1 as a template amplified the region immediately 3' of the KKTK 

15 (SEQ ID No. 1) site and extending past the Muni site located 40 base pairs 3' of 
the stop codon for the fiber gene. The two primers used for this reaction were 
3HSPF and 3FR. The DNA sequence of 3HSPF is as follows: 5' GGA GCC GGA 
GCC TCA AAC ATA AAC CTG GAA AT 3' (SEQ ID No. 1 6). It contains a 5' 
extension that is complementary to the 5' extension of 5HSPR. The DNA 

20 sequence of 3FR is as follows: 5' GTG GCA GGT TGA ATA CTA GG 3' (SEQ ID 
No. 8). 

The two PCR products were joined by PCR SOEing using primers 5FF and 
3 PR. The resulting PCR product was digested with the restriction enzymes Xbal 
and Muni. The 2355 bp fragment was gel purified and ligated with the 6477 bp 

25 Xbal to Muni fragment of the plasmid pFBshuttle(EcoRI) (Figure 8) to generate 
the plasmid pFBSEHSP. The plasmid pFBshuttle(EcoRI) was generated by 
digesting the plasmid pSQ1 with EcoRI, then gel purifying and self-ligating the 
8.8 kb fragment containing the fiber gene. Next, the fiber gene containing the 
HSP mutation was transferred from pFBSEHSP into pSQ1 using a three-way 

30 ligation. The 16,431 bp EcoRI to Ndel fragment of pSQ1 , the 9043 bp Ndel to 
Xbal fragment of pSQ1 , and the 7571 bp Xbal to EcoRI fragment of pFBSEHSP 
were isolated and ligated to generate pSQIHSP (Figure 9). 
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To generate a recombinant adenoviral vector containing the HSP mutation 
in the fiber gene, pSQ1 HSP was digested with Clal and pAdmireRSVnBg (Figure 
3A) was digested with Sail and Pad, then the two digested plasmids were 
co-transfected into 633 ceils (von Seggern eta/. (2000) J Virology 74:354-362). 
5 Homologous recombination between the two plasmids generated a full-length 
adenoviral genome capable of replication in 633 cells, which inducibly express 
Ad5 E1 A and constitutively express wild-type fiber protein. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP mutation, 

10 the viral preparation was used to infect PerC6 cells, which do not express fiber. 
The resulting virus, termed AvinBgFS*, was purified by standard CsCI 
centrifugation procedures. 

Generation of vector containing the HSP and KOI mutations 

To generate an adenoviral vector containing the HSP and KOI mutations 

15 in fiber, a PCR SOEing strategy identical to the one described above was used 
except that the plasmid pSQIFKOI was used as the template. The PCR SOEing 
product was digested with Xbal and Muni and ligated with the 6477 bp Xbal to 
Muni fragment of pFBshuttle(EcoRI) to generate pFBSEHSPKOL The fiber gene 
containing the HSP and KOI mutations was transferred from pFBSEHSPKOI into 

20 the pSQ1 backbone using a three-way ligation strategy identical to the one 
described above for the HSP mutation alone, to generate the plasmid 
pSQIHSPKOI (Figure 10). Recombinant adenoviral vector containing the HSP 
and KOI mutations in the fiber gene was generated by co-transfecting 
pSQIHSPKOI digested with Clal and pAdmireRSVnBg digested with Sail and 

25 Pact into 633 cells. Adenovirus was propagated and purified as described above 
for the vector containing the HSP mutation alone. The resulting virus was 
termed Av 1 nBgFK0 1 S * . 
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Generation of vector containing the HSP and PD1 mutations 
The following strategy was used to generate a recombinant adenoviral 
vector containing the fiber HSP mutation and the penton PD1 mutation. The 
plasmid pSQ1PD1 (Figure 4) was digested with the restriction enzymes Csp45l 
5 and Spel and the 23,976 bp fragment was isolated and purified. In addition, the 
plasmid pSQIHSP was also digested with Csp45l and Spel and the 9090 bp 
fragment was isolated and purified and ligated to the 23,976 bp fragment to 
generate the plasmid pSQ1HSPPD1 (Figure 11), which contains the fiber HSP 
and penton PD1 mutations. An adenoviral vector was generated, propagated, 

10 and purified as described above. The resulting virus was termed Av1nBgS*PD1. 
Generation of vector containing the HSP, KOI, and PD1 mutations 
To generate an adenoviral vector containing the HSP, KOI, and PD1 
mutations the following strategy was used. First, the plasmid pSQ1PD1 was 
digested with Csp45l and Spel and the 23,976 bp fragment was isolated and 

15 purified. In addition, the plasmid pSQIHSPKOI was digested with Csp45l and 
Spel and the 9090 bp fragment was isolated and purified. The two DNA 
fragments were ligated to form the plasmid pSQ1 HSPKOI PD1 {Figure 12). 
Recombinant adenoviral vector was generated, propagated, and purified as 
described above. The resulting virus was termed Av1 nBgFKO! S*PD1 . 

20 EXAMPLES 

In Vitro Evaluation Of Adenoviral Vectors Containing The HSP Fiber Mutation 

The transduction efficiencies of adenoviral vectors containing the HSP 
mutation in the fiber gene, either alone or combined with the KOI and/or PD1 
mutations, were evaluated on A549 and HeLa cells. The transduction 

25 efficiencies were compared to that of AvInBg, an adenoviral vector containing 
wild type fiber and penton. The day prior to infection, cells were seeded into 
24-well plates at a density of approximately 1 x 10 s cells per well. Immediately 
prior to infection, the exact number of cells per well was determined by counting 
a representative well of cells. Each of the vectors, AvInBg (see, Stevenson et 

30 al. (1997) J. ViroL 77:4782-4790), Av1 nBgS*, Av1 nBgFKOI S*, 

Av1nBgS*PD1, and Av1 nBgFKOI S*PD1 , were used to transduce A549 cells at 
each of the following particle per cell (PPC) ratios: 100, 500, 1000, 2500, 
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5000, 10,000. HeLa cells were transduced with each of the above vectors, as 
well as a vector containing the KOI mutation alone (AvInBgFKOD and a vector 
containing the PD1 mutation alone (AvInBgPDD at 2000 PPC. The cell 
monolayers were stained with X-gal 24 hours after infection and the percentage 
5 of cells expressing £-galactosidase was determined by microscopic observation 
and counting of cells. Transductions were done in triplicate and three random 
fields in each well were counted, for a total of nine fields per vector. 

The results (depicted in Figures T3A-13B) showed significantly reduced 
transduction efficiencies on A549 and HeLa cells using vectors containing the 
10 HSP mutation compared to Av1 nBg. The vectors containing the HSP mutations, 
however, demonstrated a dose response on A549 cells, in that increasing PPC 
ratios yielded increasing transduction. 

Competition experiments were done to determine which receptor molecular 
interactions are involved in transduction of A549 cells by the various vectors. 

1 5 Transductions were performed in the presence or absence of various competitors 
including Ad5 fiber knob, a 50 amino acid oligopeptide derived from Adenovirus 
serotype 2 penton base which spans the RGD tripeptide region, or heparin 
(Invitrogen Life Technologies, Gaithersburg, MD). Monolayers of A549 cells 
were cultured in Richters medium supplemented with 10% FBS and were 

20 transduced with AvInBg, AvInBgS*, Av1 nBgFKOI S*, Av1nBgS*PD1, or 

AvI nBgFKOI S*PD1 in infection medium (IM, Richters medium plus 2% FBS). 
Different PPC ratios were used for the different vectors to achieve measurable 
transduction levels. The PPC ratios were as follows: AvInBg: 500 PPC, 
AvInBgS*: 10,000 PPC, Av1 nBgFKOI S*: 20,000 PPC, Av1nBgS*PD1: 10,000 

25 PPC, and Av1 nBgFKOI S*PD1 : 20,000 PPC. Fiber knob competition was 

performed by pre-incubating cells in IM containing 16//g/ml of fiber knob for 10 
minutes at room temperature prior to infection with virus. Penton base peptide 
competition was performed by pre-incubating ceils in IM containing 500nM 
peptide for 10 minutes at room temperature prior to infection with virus. 

30 Heparin competition was performed by pre-incubating each adenoviral vector in 
IM containing 3 mg/ml of heparin for 20 minutes at room temperature. In all 
cases, the competitor remained in the IM during the 1 hour infection when virus 
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was rocked on the cell monolayers at 37° C in 5% C02. After infection, the 
monolayers were washed with PBS, 1 ml of complete medium was added per 
well and the cells were incubated for an additional 24 hours to allow for 
0-galactosidase expression. The cell monolayers were then fixed and stained 
5 with X-Gal. The percentage of cells transduced was determined by light 

microscopy as described above. Each condition was carried out in triplicate and 
three random fields per well were counted, for a total of nine fields per 
condition. The average percentage of transduction per high-power field was 
determined. 

10 The results of the competition experiment (Figure 13C) showed that fiber 

knob inhibited transduction of cells by all vectors except for those that contained 
the KOI mutation. The penton base peptide only inhibited transduction by 
AvInBgFKOIS*. Heparin inhibited transduction by Av1 nBgFKOIS* and 
Av1nBgFK01S*PD1, but did not affect transduction by any of the other viruses 

1 5 suggesting the presence of additional heparin binding sites on the adenoviral 
capsid but that the shaft contains the predominant site. 

EXAMPLE 6 

In Vivo Analysis Of Adenoviral Vectors Containing The HSP Mutation In Fiber 

The objective of this study was to evaluate the in vivo biodistribution of 
20 adenoviral vectors containing the HSP mutation and to determine whether this 
shaft modification influences adenoviral-mediated liver transduction. In addition, 
vectors containing the HSP mutation combined with KOI, or PD1, or a 
combination of all three mutations were evaluated as well as vectors containing 
the KOI mutation alone and the PD1 mutation alone. A positive control cohort 
25 received AvInBg and a negative control group received HBSS. Cohorts of five 
C57BL/6 mice received each vector via tail vein injection at a dose of 1 x 10 13 
particles per kg. The animals were sacrificed approximately 72 hours after vector 
administration by carbon dioxide asphyxiation. Liver, heart, lung, spleen, and 
kidney were collected from each animal. The median lobe of the liver was placed 
30 in neutral buffered formalin to preserve the sample for 0-galactosidase 

immunohistochemistry- In addition, tissue from each organ was frozen to 
preserve it for hexon real time PCR analysis to determine vector content. A 
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separate sample of liver from each mouse was frozen to preserve it for a 
chemiluminescent £-galactosidase activity assay, ^-galactosidase 
immunohistochemistry, hexon real-time PCR and the chemiluminescent 
jff-galactosidase activity assay were carried out as described in Example 3. 
5 The results of the £-galactosidase activity assay (Figure 1 4A) and adenoviral 
hexon DNA content (Figure 1 4B) showed a dramatic reduction in liver 
transduction by vectors containing the HSP mutation. The vectors containing 
the HSP mutation alone resulted in reducing adenoviral-mediated liver gene 
expression by approximately 20-fold.When combined with the KOI mutation 

10 (HSP, KOI, PD1), yielded approximately a 1000-fold reduction in ^-galactosidase 
activity in the liver compared to the control vector AvInBg. The vector 
containing the KOI mutation alone showed a slight increase, on average, in liver 
transduction compared to AvInBg, which is consistent with several previous 
experiments. The vectors containing the PD1 mutation alone or combined with 

15 KOI showed a slight decrease in liver transduction compared to AvInBg, 
although the decrease was not statistically significant. Analysis of hepatic 
adenoviral hexon DNA content (Figure 14B) confirmed these results. 

The results of the immunohistochemical staining of liver sections for 0- 
galactosidase were consistent with the activity assays (data not shown) and 

20 demonstrated that gene expression was localized specifically to hepatocytes. 
Vectors containing the HSP mutation, either alone or in combination with K01 
and/or PD1, showed a dramatic reduction in hepatocyte transduction. The 
vector containing the K01 mutation alone showed a slight increase in liver 
transduction as revealed by a more intense and frequent immunohistochemical 

25 staining pattern. The vectors containing the PD1 mutation, either alone or 
combined with KOI, showed little difference in transduction compared to 
AvInBg. 

EXAMPLE 7 

Description of Adenoviral Vectors Containing the HSP Fiber Shaft Mutation with 
30 and without the KOI Fiber Mutation and with and without a cRGD Targeting 
Ligand in the Fiber Knob HI Loop 
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Generation of vector containing the HSP fiber shaft mutation and a cRGD ligand 
in the HI loop: The following strategy was used to generate an adenoviral vector 
containing a fiber with the HSP shaft mutation and a cRGD ligand in the HI loop. 
The plasmid p5FloxHRFRGD was digested with the restriction enzymes BstXI 
5 and Kpnl and the 1 157 bp fragment was isolated and purified. In addition, the 
fiber shuttle plasmid pFBSEHSP, described in Example 1 above, was digested 
with BstXI and Kpnl and the 4549 bp and 3156 bp fragments were isolated and 
purified. The three fragments were ligated to generate the plasmid 
pFBSEHSPRGD, which encodes a fiber containing the HSP mutation and cRGD in 

10 the HI loop. The fiber gene from this plasmid was transferred into the pSQI 

backbone as follows. The plasmid pFBSEHSPRGD was digested with EcoRI and 
Xbal and the 7601 bp fragment was isolated and purified. The plasmid pSQ1 
(Figure 3B) was digested with the restriction enzymes EcoRI, Ndel, and Xbal and 
the 16,431 bp EcoRI to Ndel fragment and the 9043 bp Ndel to Xbal fragment 

15 were isolated and purified. The three DNA fragments were ligated to generate 
the plasmid pSQIHSPRGD (Figure 15A). 

To generate a recombinant adenoviral vector containing the HSP mutation 
in the fiber gene along with a cRGD ligand in the HI loop, the plasmid 
pSQIHSPRGD was digested with Clal and co-transfected into 633 cells with 

20 pAdmireRSVnBg which had been digested with Sail and Pad. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP mutation 
and a cRGD ligand, the viral preparation was used to infect PerC6 cells, which 
do not express fiber. The resulting virus, termed Av1nBgS*RGD, was purified 

25 by standard CsCI centrifugation procedures. 

Generation of vector containing the HSP fiber shaft mutation, the KOI fiber knob 
mutation, and a cRGD ligand in the HI loop 

The following strategy was used to generate an adenoviral vector 

containing a fiber with the HSP shaft mutation, the KOI fiber knob mutation, 

30 and a cRGD ligand in the HI loop. The plasmid pSFIoxHRFRGD was digested 

with the restriction enzymes BstXI and Kpnl and the 1 1 57 bp fragment was 

isolated and purified. In addition, the fiber shuttle plasmid pFBSEHSPKOI, 
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described in Example 1 above, was digested with BstXI and Kpnl and the 4549 
bp and 31 56 bp fragments were isolated and purified. The three fragments were 
ligated to generate the plasmid pFBSEHSPKOI RGD, which encodes a fiber 
containing the HSP mutation, the K01 mutation, and cRGD in the HI loop. The 
5 fiber gene from this plasmid was transferred into the pSQ1 backbone as follows. 
The plasmid pFBSEHSPKPOl RGD was digested with EcoRI and Xbal and the 
7601 bp fragment was isolated and purified. The plasmid pSQ1 (Figure 3B) was 
digested with the restriction enzymes EcoRI, Ndel, and Xbal and the 16,431 bp 
EcoRI to Ndel fragment and the 9043 bp Ndel to Xbal fragment were isolated 

10 and purified. The three DNA fragments were ligated to generate the plasmid 
pSQIHSPKOIRGD (Figure 15B). 

To generate a recombinant adenoviral vector containing the HSP and KOI 
mutations in the fiber gene along with a cRGD ligand in the HI loop, the plasmid 
pSQIHSPKOIRGD was digested with Clal and co-transfected into 633 cells with 

15 pAdmireRSVnBg which had been digested with Sail and Pad. After propagation 
on 633 cells, the virus capsid contained wildtype and mutant fiber proteins. To 
obtain viral particles containing only the modified fiber with the HSP and KOI 
mutations and a cRGD ligand, the viral preparation was used to infect PerC6 
cells, which do not express fiber. The resulting virus, termed 

20 Av1nBgFK01S*RGD, was purified by standard CsCI centrifugation procedures. 

EXAMPLE 8 

In Vitro Evaluation of Adenoviral Vectors Containing the HSP Fiber Shaft 
Mutation with or without the Fiber Knob KOI Mutation and with or without a 
cRGD Ligand in the Hi Loop 

25 The transduction efficiencies of adenoviral vectors containing the HSP 

fiber shaft mutation with or without the fiber KOI mutation and with or without 
the cRGD ligand in the HI loop were evaluated on A549 cells. The transduction 
efficiencies were compared to that of AvInBg, an adenoviral vector containing 
wild type fiber. The day prior to infection, cells were seeded into 24-well plates 

30 at a density of approximately 1 x 10 5 cells per well. Immediately prior to 
infection, the exact number of cells per well was determined by counting a 
representative well of cells. Each of the vectors, AvInBg, AvInBgS*, 
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Av1nBgFK01S*, Av1nBgS*RGD, and Av1 nBgFKOI S*RGD, were used to 
transduce A549 cells at a particle to cell ratio of 6250. The cell monolayers 
were stained with X-gal 24 hours after infection and the percentage of cells 
expressing /?-galactosidase was determined by microscopic observation and 
5 counting of cells. Transductions were done in triplicate and three random fields 
in each well were counted, for a total of nine fields per vector. The results 
(Figure 16) showed that the cRGD ligand dramatically increased the transduction 
efficiencies of vectors containing the HSP mutation alone or combined with the 
K01 mutation. AvInBgS* yielded approximately 22% positive cells, while 
10 Av1nBgS*RGD yielded approximately 95% positive cells. Similarly, 

Av1 nBgFKOI S* yielded only 4% positive cells, while Av1 nBgFKOI S*RGD 
yielded 85% positive cells. Therefore, the vector containing the shaft mutation 
is viable and can be retargeted with the addition of a ligand. 

EXAMPLE 9 

15 Construction Of Ad5 Vectors Containing The Ad35 Fiber And Derivatives 
Thereof 

The KOI and HSP mutations in the Ad5 fiber protein (5F), described 
above, were designed to ablate interactions that are responsible for the normal 
tropism of the Ad5 virus. An alternative strategy to detarget the virus is to 

20 replace the Ad5 fiber with a fiber from another serotype which does not bind 

CAR and which does not possess the heparin sulfate proteoglycan (HSP) binding 
domain (KKTK; SEQ ID No. 1 ) within the shaft. The fiber of adenovirus serotype 
35 (35F) does not bind CAR and does not possess the HSP binding domain in its 
shaft. Replacement of the 5F with the 35F can detarget the liver and provide a. 

25 suitable platform for retargeting the vector to the desired tissue. 

Generation of an Ad5 based vector containing the Ad35 fiber: A PCR 
SOEing strategy was used to generate a vector based on the Ad5 serotype but 
containing the Ad35 fiber in place of the Ad5 fiber. First, PCR was used to 
amplify a region in the plasmid pSQ1 between the Xba! site at bp 25,309 and 

30 the start of the fiber gene. The primers used for this reaction were P-0005/U 
and P-0006/L. The DNA sequence of P-0005/U was as follows: 5' C TCT AGA 
AAT GGA CGG AAT TAT TAC AG 3' (SEQ ID No. 1 7). This sequence 
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corresponds to bp 25,308 through 25,334 of pSQ1 . The DNA sequence of 
P-00O6/L was as follows: 5' TCT TGG TCA TCT GCA ACA ACA TGA AGA TAG 
TG 3' (SEQ ID No. 18). It contains a 10 base pair 5' extension that is 
complementary to the start of the Ad35 fiber gene, while the remainder of the 
5 primer anneals to the sequence immediately 5' of the ATG start codon of the 
fiber gene in pSQI . A PCR product of the expected size, 583 bp, was obtained 
and the DNA was gel purified. A second PCR amplified the Ad35 fiber gene 
using DNA extracted from wildtype Ad35 virus as a template. The primers used 
for this reaction were P-0007/U and 35FMun. The DNA sequence of P-0007/U 

10 ; was as follows: 5' GT TGT TGC AG ATG ACC AAG AGA GTC CGG CTC A 3' 
(SEQ ID No. 19). It contains a 10 base pair 5' extension that is homologous to 
the 10 bp immediately prior to the ATG start codon of the fiber gene in Ad5. 
The remainder of the primer anneals to the start of the Ad35 fiber gene. The 
DNA sequence of 35FMun was as follows: 5' AG CAA TTG AAA AAT AAA 

1 5 CAC GTT G AA ACA TAA CAC AAA CG A TTC TTT A GTT GTC GTC TTC TGT 
AAT GTA AGA A 3' (SEQ ID No. 20). It contains a 46 base pair 5' extension 
that is complementary to the region of the Ad5 genome between the end of fiber 
and the Muni site 40 bp downstream of the fiber gene. In addition, the 5' 
extension encodes the last amino acid and stop codon of the Ad5 fiber gene. 

20 This region was retained in the vector because it contains the polyadenylation 
site for the fiber gene. The remainder of the primer anneals to the 3' end of the 
Ad35 fiber gene, up to the next to last amino acid codon. A PCR product of the 
expected size, 1027 bp, was obtained and the DNA was gel purified. The two 
PCR products were mixed and joined together by PCR SOEing using primers 

25 P-0005/U and P-0009. The DNA sequence of P-0009 was as follows: 5' AG 

CAA TTG AAA AAT AAA CAC GTT G 3' (SEQ ID No. 21 ). It corresponds to bp 
27,648 through 27,669 of pSQ1 and overlaps the Muni site in that region. A 
PCR product of the expected size, 1 590 bp, was obtained and gel purified. It 
was cloned into the plasmid pCR4blunt-TOPO (Invitrogen Corporation, Carlsbad 

30 CA) using the Zero Blunt TOPO PCR Cloning Kit from Invitrogen. This 

intermediate cloning step simplified DNA sequencing of the PCR SOEing product. 
The resulting plasmid, termed pTOPOAd35F, was digested with Xbal and Muni 
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and the 1 585 bp digestion product was gel purified and ligated with the 6477 bp 
fragment of pFBshuttle(EcoRI) digested with Xbal and Muni to generate the 
plasmid pFBshuttleAd35F. The Ad35 fiber gene was transferred from 
pFBshuttleAd35F into pSQ1 as follows. The plasmid pSQ1 was digested with 
5 EcoRI and the 24,213 bp fragment was gel purified. The plasmid 

pFBshuttleAd35F was linearized with EcoRI and ligated with the 24,213 bp 
fragment from pSQ1 . Restriction diagnostics were performed to screen for 
constructs containing the Ad35 fiber gene inserted into the pSQ1 backbone in 
the correct orientation. The pSQ1 plasmid containing the Ad35 fiber gene in the 

10 proper orientation was termed pSQ1 Ad35Fiber (Figure 17A). To generate 

adenoviral vector containing the Ad35 fiber, pSQ1 Ad35Fiber was digested with 
Clal and co-transfected into 633 cells with pAdmireRSVnBg which had been 
digested with Sail and Pad. After propagation on 633 cells, the resulting virus 
contained Ad5 fiber and Ad35 fibers on its capsid. The virus was amplified on 

15 PerC6 cells to generate virus containing only the Ad35 fiber on its capsid. The 
resulting virus preparation was termed Av1nBg35F. 

Construction of adenoviral vectors containing chimeric fibers derived from 
Ad5 and Ad35: Two chimeric fiber constructs were prepared by PCR gene 
overlap extension using plasmids containing the full length Ad5 or Ad35 fiber 

20 cDNAs as templates. The Ad5 fiber tail and shaft regions (5TS; amino acids 1 
to 403) were connected with the Ad35 fiber head region (35H; amino acids 137 
to 323) to form the 5TS35H chimera, and the Ad35 fiber tail and shaft regions 
(35TS; amino acids 1 to 136) were connected with the Ad5 fiber head region 
(5H; amino acids 404 to 581) to form the 35TS5H chimera. The fusions were 

25 made at the conserved TLWT sequence at the fiber shaft-head junction. 

For the construction of the 5TS35H chimera, the pFBshuttle(EcoR1 ) 
plasmid was used as the template with primers P1 and P2 to generate the 5' 
fragment. The 3' fragment was generated using the pFBshuttleAd35 plasmid as 
the template with the P3 and P4 primers. The sequence of each primer used in 

30 the construction of these chimeric fibers is listed in Table 2. Amplified PCR 
products of the expected size were obtained and were gel purified. A second 
PCR was carried out with the end primers P1 and P4 to join the two fragments 
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together. The DNA fragment generated in the second PCR was digested with 
Xba1 and Muni and was cloned directly into pFBshuttle(EcoR1 ) to create the 
fiber shuttle plasmid pFBshuttle5TS35H. 

TABLE 2 

5 Primers Used For The Exchange Of Fiber Shaft Regions Between Ad5 And Ad35 

Fibers 



Primer 
designation 


Sequence 


SEQ ID 


P1 


5'-GAACAGGAGGTGAGCTTAGA-3' 


22 


P2 


5'-GTTAGGTGGAGGGTTTATTCCGGTCCAC 
AAAGTTAGCTTATC-3' 


23 


P3 


5'-GATAAGCTAACTTTGTGGACCGGAATAAA 
CCCTCCACCTAAC-3' 


24 


P4 


5'-GTGGCAGGTTGAATACTAGG-3 


25 


P5 


5'-GTTAGGAGATGGAGCTGGTGTAGTCCATA 
AG G TGTT A AT AC- 3 ' 


26 


P6 


5 f -GTATTAACACCTTATGGACTACACCAGCT 
CCATCTCCTAAC-3' 


27 


P7 


5'-TGCGCAAAAACAATCACCACGACAATCACAAT 
GTACATTGGAAGAAATCATACG-3' 


28 


P8 


5'-ACATTGTGATTGTCGTGGTGATT 

Gil I I IGCGCAIAIGCCAIACAAI I I GAATG-3' 


29 



For the construction of the 35TS5H chimera, the pFBshutt!eAd35 plasmid 
was used as the template with the P1 and P5 primers to generate the 5' 
20 fragment. The 3' fragment was generated using the pFBshuttle(EcoR1 ) plasmid 
as the template with the P6 and P4 primers. Following the same procedure 
described above, the fiber shuttle plasmid pFBshuttle35TS5H was generated. 

For the 35TS5H and 5TS35H chimeras, the fiber gene was transferred 
from the pFBshuttle(EcoRI) backbone into pSQ1 as described above for the 
25 vector containing the Ad35 fiber. The resulting plasmids were called 

pSQ135T5H (Figure 18A) and pSQ15T35H (Figure 18B). In addition, adenoviral 
vectors were generated using the co-transfection strategy described above. 

Construction of Ad5 vectors containing the Ad35 fiber with a cRGD 
targeting peptide in the HI loop of the 35F fiber knob: To incorporate the cRGD 
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targeting peptide into the Ad35 fiber HI loop, the P7 and P8 oligonucleotide 
primers encoding the ten amino acid sequence HCDCRGDCFC (SEQ ID No. 30) 
were synthesized. The pFBshuttleAd35 plasmid containing the full length Ad35 
fiber cDNA was used as the template in the PCR reaction with the P1 and P7 
5 primer pair or with the P4 and P8 primer pair in order to generate the 5' and 3' 
PCR fragments. A second PCR was then carried out with the end primers P1 
and P4 to join the two fragments together. The resulting PCR fragment was 
digested with Xba1 and Muni and was cloned into pFBshuttle(EcoR1 ) to create 
the fiber shuttle plasmid pFBshutt!eAd35cRGD. The modified Ad35 fiber gene 
10 was transferred into pSQ1 using the EcoRl cloning strategy described above to 
generate pSQ1 Ad35FcRGD (Figure 17B). Adenoviral vector was generated 
using the co-transfection strategy described above. 

EXAMPLE 10 

In Vitro Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
1 5 Thereof 

The transduction efficiencies of adenoviral vectors containing the 35F or 
derivatives thereof were evaluated on A549 cells. The transduction efficiencies 
were compared to that of AvInBg, an adenoviral vector containing the 5F fiber. 
The day prior to infection, cells were seeded into 24-well plates at a density of 

20 approximately 1 x 10 5 cells per well. Immediately prior to infection, the exact 
number of cells per well was determined by counting a representative well of 
cells. Each of the vectors, AvInBg, Av1 nBg35F, Av1nBg5T35H and 
Av1nBg35T5H were used to transduce A549 cells from 0 up to 1,000 particle 
per cell (PPC) ratios. The cell monolayers were stained with X-gal 24 hours after 

25 infection and the percentage of cells expressing ^-galactosidase was determined 
by microscopic observation and counting of cells. Transductions were done in 
triplicate and three random fields in each well were counted, for a total of nine 
fields per vector. The results (Figure 1 9) showed similar transduction 
efficiencies on A549 cells using the Av1nBg35F and Av1nBg5T35H vectors 

30 compared to AvInBg. The Av1nBg35T5H showed much lower transduction 
efficiencies on A549 cells compared to AvInBg as a result of the Ad35 shaft 
domain. The Ad35 shaft domain does not contain a HSP binding motif and the 
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Av1nBg35T5H vector behaves similarly to the AvInBgS* vector in vitro and in 
vivo. These studies also demonstrate that vectors containing fiber proteins 
without an HSP binding site are fully viable. 

EXAMPLE 1 1 

5 in Vivo Evaluation Of Adenoviral Vectors Containing 35F And Derivatives 
Thereof 

The objective of this study was to evaluate the in vivo biodistribution of 
adenoviral vectors containing 35F fibers and derivatives thereof to determine 
whether vectors containing these fibers ablate liver transduction due to their 

10 shaft regions. A positive control cohort received AvInBg and a negative control 
group received HBSS. Cohorts of five C57BL/6 mice received each vector via 
tail vein injection at a dose of 1 x 10 13 particles per kg. The animals were 
sacrificed approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Liver, heart, lung, spleen, and kidney were collected from each 

15 animal. The median lobe of the liver was placed in neutral buffered formalin to 
preserve the sample for jff-galactosidase immunohistochemistry- In addition, 
tissue from each organ was frozen to preserve it for hexon PCR analysis to 
determine vector content. A separate sample of liver from each mouse was 
frozen to preserve it for a chemiluminescent /?-galactosidase activity assay. /?- 

20 galactosidase immunohistochemistry, hexon real-time PCR and the 

chemiluminescent >?-galactosidase activity assay were carried out as described in 
example 3. 

The results of the ^-galactosidase activity assay showed a dramatic 
reduction in liver transduction by vectors containing the Ad35 fiber or the 
25 35T5H derivative (Figure 20) with an approximately 4- to 24-fold reduction in jff- 
galactosidase activity in the liver compared to the control vector AvInBg. These 
data demonstrate that shaft domains without HSP binding sites can effectively 
ablate hepatic in vivo gene transfer. In particular, HSP is the major entry 
mechanism for liver in vivo. CAR binding is a minor entry pathway. 
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EXAMPLE 12 

Construction Of Ad5 Vectors Containing The Ad Serotype 41 Short Fiber And 
Derivatives Thereof 

The human adenovirus serotype 41 contains two different fibers on its 

5 capsid, encoded by two adjacent genes. One fiber has a molecular weight of 

60kDa and is approximately 315A in length and is termed the long fiber. The 

other fiber has a molecular weight of 40kDa and is approximately 250-f- in 

length and is termed the short fiber. The Ad41 short fiber does not bind CAR 

and does not possess the heparin binding domain (KKTK) in its shaft. Therefore, 

10 this fiber provides a useful platform for adenoviral vector targeting. 

Construction of adenoviral vectors based on Ad5 but containing the Ad41 
short fiber: A PCR SOEing strategy was used to generate a vector based on the 
Ad5 genome but containing the Ad41 short (Ad41s) fiber. First, PCR was used 
to amplify the region of pSQ1 between the Xbal site at bp 25,309 and the start 

15 of the fiber gene. The primer pair used for the PCR were P-0005/U and 

P-0010/L. The DNA sequence of P-0005/U was as follows: 5' C TCT AGA AAT 
GGA CGG AAT TAT TAC AG 3' (SEQ ID No. 17). The sequence corresponds to 
bp 25,308 through 25,334 of pSQ1 and overlaps the Xbal site in that region. 
The DNA sequence of P-0010/L was as follows: 5' TTC TTT TCA T CTG CAA 

20 CAA CAT GAA GAT AGT G 3' (SEQ ID No. 31). It contains a 5' extension 

corresponding to the first 10 bp of the Ad41s fiber gene. The remainder of the 
primer anneals to pSQ1 immediately 5' of the ATG start codon of the fiber gene. 
The PCR product was the expected size (583 bp). A second PCR was used to 
amplify the Ad41s fiber using the plasmid pDV60Ad41sF as a template. The 

25 primers used were P-001 1/U and P-001 2/L. The DNA sequence of P-001 1/U 
was as follows: 5' GT TGT TGC AG ATG AAA AGA ACC AGA ATT GAA G 3' 
(SEQ ID No. 32). It contains a 10 bp 5' extension corresponding to the DNA 
sequence immediately 5' of the ATG start codon of the fiber gene in pSQ1 . The 
remainder of the primer anneals to the beginning of the Ad41s fiber gene in 

30 pDV60Ad41sF. The DNA sequence of P-001 2/L was as follows: 5' TG CAA 
TTG AAA AAT AAA CAC GTT GAA ACA TAA CAC AAA CGA TTC TTT ATT C 
TTC AGT TAT GTA GCA AAA TAC A 3' (SEQ ID No. 33). It contains a 51 bp 
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5' extension corresponding to the sequence in pSQ1 from the last codon of the 
fiber gene through the Muni site 40 bp downstream of the fiber gene. The 
remainder of the primer anneals to the 3' end of the Ad41s fiber gene in 
pDV60Ad41sF. The PCR product was the expected size (1219 bp). The two 
5 PCR products were joined by PCR SOEing using primers P-0005/U and P-0009/L. 
The DNA sequence of P-0009/L was described above. The PCR SOEing reaction 
yielded the expected 1 782 bp product. The product was cloned into 
pCR4blunt-TOPO to yield pCR4blunt-TOPOAd41 sF. Next, 
pCR4blunt-TOPOAd41sF was digested with Xbal and Muni and the 1773 bp 
10 fragment containing the Ad41s fiber gene was gel purified. This fragment was 
ligated with the 6477 bp Xbal to Muni fragment of pFBshuttle(EcoRI) to generate 
pFBshuttleAd41 sF. The Ad41s fiber gene was transferred into the pSQ1 
backbone as follows. First, pFBshuttleAd41 sF was linearized using EcoRI and 
this fragment was ligated with the 24,213 bp EcoRI fragment of pSQ1 to 
15 generate pSQ1Ad41sF (Figure 21 A). Adenoviral vector containing the Ad41s 
fiber was generated using the co-transfection strategy described above. 

Construction of Ad5 adenoviral vectors containing the Ad41 short fiber 
with a cRGD targeting ligand in the HI loop: A PCR SOEing strategy was used to 
generate a construct containing the Ad41s fiber with cRGD in the HI loop. The 
20 plasmid pFBshuttleAd41 sF was used as a template for the PCR amplifications. 
First, a 1782 bp fragment was amplified using primers 5FF and 41sRGDR. The 
primer 5FF was described above. It anneals to pFBshuttleAd41sF at the Xbal 
site upstream of the fiber gene. The DNA sequence of the primer 41sRGDR was 
as follows: 5' AGT ACA AAA ACA ATC ACC ACG ACA ATC ACA GTT TAT 
25 CTC GTT GTA GAC GAC ACT GA 3' SEQ ID No. 34). It contains a 30 bp 5' 

extension that encodes the cRGD targeting ligand. The remainder of the primer 
anneals to pFBshuttleAd41 sF from bp 2878 through 2903. A second PCR 
amplified a 277bp region of pFBshuttleAd41 sF using primers 3FR and 41sRGDF. 
The primer 3FR was described previously. It anneals to pFBshuttleAd41 sF at the 
30 Muni site downstream of the fiber gene. The DNA sequence of 41sRGDF was 
as follows: 5' TGT GAT TGT CGT GGT GAT TGT TTT TGT ACT AGT GGG TAT 
GCT TTT ACT TTT 3' (SEQ ID No. 35). It contains a 30 bp 5' extension that 
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encodes the cRGD targeting ligand and is complementary to the extension on 
41 sRGDR. The remainder of the primer anneals to pFBshuttleAd41 sF from bp 
2904 through 2924. The two PCR products were joined by PCR SOEing to 
generate a 2059 bp fragment using primers 5FF and 3FR. The product was 
5 digested with Xbal and Muhl and the 1803 bp DNA fragment was gel purified. 
The fragment was ligated with the 6477 bp fragment resulting from digestion of 
pFBshuttle(EcoRI) with Xbal and Muni. The resulting plasmid was termed 
pFBshuttleAd41 sRGD. This plasmid was linearized by EcoRI digestion and 
ligated with the 24,21 3bp EcoRI fragment of pSQ1 to generate pSQ1 Ad41sRGD 
10 (Figure 21 B). 

EXAMPLE 13 

In Vivo Evaluation Of Ad5 Vectors Containing The Ad41 Short Fiber And 
Derivatives Thereof 

This example evaluates the in vivo biodistribution of adenoviral vectors 

15 containing 41 sF fibers and derivatives thereof to determine whether vectors 
containing the these fibers ablate liver transduction due to modified shaft 
regions. A positive control cohort received Av3nBg (see, Gorziglia et al. (1996) 
J. Virology 70:4173-4178) or Ad5./?Gal.AF/5F, and a negative control group 
received HBSS. Ad5.£Gal.AF/5F is a derivative of the fiberless vector 

20 Ad5.£gal.AF (ATCC accession number VR2636) modified to express AD5 fiber 
(see, e.g., International PCT application No. WO01 83729). 

The Ad5./?Gal.AF vector was pseudotyped with the Ad41sF fiber protein 
and injected in vivo. Cohorts of five C57BL/6 mice received each vector via tail 
vein injection at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 

25 approximately 72 hours after vector administration by carbon dioxide 

asphyxiation. Liver, heart, lung, spleen, and kidney were collected from each 
animal. The median lobe of the liver was placed in neutral buffered formalin to 
preserve the sample for £-galactosidase immunohistochemistry. In addition, 
tissue from each organ was frozen to preserve it for hexon PCR analysis to 

30 determine vector content. A separate sample of liver from each mouse was 
frozen to preserve it for a chemiluminescent £-galactosidase activity assay. 0- 
galactosidase immunohistochemistry, hexon real-time PCR and the 



WO 03/062400 PCT/US03/02295 



-81- 

chemiluminescent /?-galactosidase activity assay was carried out as described in 
example 3. 

The results of the hexon DNA analysis showed a dramatic reduction in 
liver transduction by vectors containing the Ad41sF fiber (Figure 22) with an 
5 approximately a 5-fold reduction in liver adenoviral DNA content compared to 
either control vector. 

In the above examples, several novel adenoviral vectors were generated 
containing various fiber modifications designed to ablate the normal tropism of 
the vector. See Table 3. Vectors were generated in which the heparan sulfate 
10 binding domain in the fiber shaft was replaced by amino acid substitutions. This 
mutation, termed HSP, was also combined with the K01 mutation (fiber knob AB 
loop mutation that ablates CAR binding), and the PD1 mutation (penton mutation 
that eliminates RGD/integrin interaction). In addition, a vector containing all three 
mutations (HSP, KOI, PD1) was generated. All vectors containing the HSP 

15 mutation, either alone or combined with other capsid modifications, showed 
dramatically reduced transduction efficiencies on A549 and HeLa cells. 
Furthermore, the same vectors showed dramatically reduced transduction of the 
liver following systemic delivery to mice. As an alternative strategy to ablate the 
normal tropism of Ad5-based vectors, the Ad5 fiber was replaced by a fiber from 

20 a different adenovirus serotype which does not bind CAR and does not contain 
the heparan binding domain in the shaft. Thus, vectors were generated 
containing the Ad35 fiber and the Ad41 short fiber. Versions of these two 
vectors containing a cRGD targeting ligand in the HI loop of the fiber were also 
produced. Additionally, vectors containing chimeric fibers were generated. A 

25 vector containing the Ad35 fiber tail and shaft regions fused to the Ad5 fiber 
knob domain as well as a vector containing the Ad5 fiber tail and shaft fused to 
the Ad35 fiber knob domain were constructed. Vectors containing either the 
entire Ad35 or Ad41 short fiber showed a significant reduction in liver 
transduction following delivery to mice via the tail vein. The observation of 

30 reduced liver transduction using vectors containing either an HSP mutation, the 
Ad35 fiber, or the Ad41 short fiber indicates the feasibility of detargeting 
adenoviral vectors in vivo. In vitro data with the Ad35 fiber or the Ad41 short 
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fiber with cRGD (see Example 14} indicate that the virus is completely viable, 

that is, it is not damaged by the absence of an HSP binding site and is 

retargetable. Taken together these data suggest that these vectors provide a 

suitable platform' for retargeting strategies. 

5 TABLE 3 

Description Of Recombinant Adenoviral Vectors Used 
To Demonstrate That Shaft Modifications Influence Tropism In Vivo 

Vector 



Vector 


Description 


AvInBg 


An E1 and E3-deleted adenoviral vector encoding a nuclear 
localizing j£?-galactosidase 


Ad 5 Fiber derivatives: 


AvInBgFKOI 


The same as AvInBg but containing the KOI AB loop 
mutation in the fiber gene 


Av1nBgPD1 


The same as AvInBg but containing the penton PD1 mutation 
that deletes the integrin binding, RGD tripeptide 


AvInBgS* 


The same as Av1 nBg but containing the 4 amino acid 
substitution in the shaft referred to as S* that modifies the 
HSP binding motif 


AvInBgFKOI S* 


The same as AvInBg but containing the fiber K01 and S* 
mutations combined 


Av1nBgS*PD1 


The same as AvInBg but containing the fiber S* and penton 
PD1 mutations combined 


AvlnBgFK01S*PD1 


The same as AvInBg but containing the fiber KOT, S* and 
penton PD1 mutations combined 


Ad35 fiber derivatives: 


Av1nBg35F 


The same as Av1 nBg but containing the full length Ad35 fiber 
cDNA 


AvInBg 5T35H 


The same as Av1 nBg but containing the 5T35H chimeric fiber 


Av1nBg35T5H 


The same as Av1 nBg but containing the 35T5H chimeric fiber 


Av1nBg35FRGD 


The same as Av1 nBg but containing the full length Ad35 fiber 
cDNA with a cRGD ligand in the HI loop of the Ad35 fiber 


Ad41sF fiber derivatives: 


Av1nBg41sF 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA 
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Vector 


Description 


Av1nBg41sFRGD 


The same as AvInBg but containing the full length Ad41 short 
fiber cDNA with a cRGD ligand in the HI loop of the Ad41 
short fiber 



EXAMPLE 14 

In Vitro Evaluation Of Adenoviral Vectors Containing The Ad41sF With A cRGD 
5 Ligand In The HI Loop 

The transduction efficiencies of adenoviral vectors containing the Ad41 sF 

fiber with the cRGD ligand in the HI loop were evaluated on A549 cells. The 

transduction efficiencies were compared to that of Av1 nBg, an adenoviral vector 

containing wild type fiber or Av1 nBgFKOI RGD, an adenoviral vector containing 

10 the K01 mutation in combination with the cRGD ligand in the HI loop. The day 
prior to infection, cells were seeded into 24-well plates at a density of 
approximately 1 x 10 5 cells per well. Immediately prior to infection, the exact 
number of cells per well was determined by counting a representative well of 
cells. Each of the vectors, AvInBg, Av1 nBgFKOI RGD, and Av1 nBg41 sFRGD 

15 were used to transduce A549 cells at a particle to cell ratios of O up to 10,000. 
The cell monolayers were stained with X-gal 24 hours after infection and the 
percentage of cells expressing yff-galactosidase was determined by microscopic 
observation and counting of cells. Transductions were done in triplicate and 
three random fields in each well were counted, for a total of nine fields per 

20 vector. The results (Figure 23) show that the Av1 nBg41 sFRGD vector 

transduced cells to an equivalent level as Av1 nBgFKOI RGD at all vector doses 
examined. Neither FKOl or Ad41sF can bind CAR. The Ad41sF does not 
normally interact with CAR and additionally does not contain the HSP binding 
motif within the shaft domain. These data show that targeting peptides inserted 

25 into the loop regions of the fiber knob of K01 and Ad41sF allows for 

transduction of target cells via the targeted receptor. Surprisingly, HSP, not 
CAR and integrins, is the major entry route in vivo and ablation of HSP binding 
permits targeting of adenoviral vectors. 
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EXAMPLE 15 

Effect of the shaft modification on the biodistribution of adenoviral vectors in 
vivo 

The influence of fiber and penton modifications on the in vivo 
5 biodistribution of adenoviral vectors containing fiber head, shaft and penton 
mutations was examined. Vectors containing the HSP mutation combined with 
K01, or PD1, or a combination of all three mutations were evaluated as well as 
vectors containing the K01 mutation alone and the PD1 mutation alone. The 
indicated adenoviral vectors were systemically administered to C57BL6 mice as 

10 described above. A positive control cohort received AvInBg and a negative 
control group received HBSS. Cohorts of five C57BL/6 mice received each 
vector via tail vein injection at a dose of 1 x 10 13 particles per kg. The animals 
were sacrificed approximately 72 hours after vector administration by carbon 
dioxide asphyxiation. Liver, heart, lung, spleen, and kidney were collected from 

15 each animal. Tissue from each organ was frozen to preserve it for real time PGR 
analysis to determine adenoviral hexon DNA content. A separate sample of liver 
from each mouse was frozen to preserve it for a chemiluminescent /?- 
galactosidase activity assay. Hexon real-time PCR and the chemiluminescent fi- 
galactosidase activity assay was carried out as described in Example 3. 

20 The results derived from the liver are described in Example 6 (Figure 14A 

and B) and also shown in Figure 26 with results presented as percent control of 
AvInBg. The effect of the S* shaft modification on the biodistribution of 
adenovirus to the other organs is shown in Figure 25. The average adenoviral 
DNA content was determined as adenoviral genomic copies per cell and 

25 expressed as a percentage of the AvInBg ( + ) control value. The average 

percent control value + standard deviation is shown (n = 5 per group) for each 
tissue examined (Figure 25). Systemic delivery of Ad5 based vectors with 
wild-type fiber results in a preferential accumulation of vector DNA in the liver 
with 64 copies per cell with significantly less DNA found in the other organs 

30 with 1.32 copies per cell found in lung, 2.18 copies per cell in spleen, 0.47 
copies per cell found in heart, and 0.72 copies per cell in the kidney. All 
differences found with PD1, S*, K01PD1, KOIS*, S*PD1, and KO!S*PD1 were 
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significantly different than the AvInBg ( + ) control using a unpaired, t-test 
analysis, P value ( 0.024. When expressed as a percent of the AvInBg control 
values, the influence of each mutation, individually or in combination, becomes 
apparent. The S* mutation dramatically reduced gene transfer to all four organs, 
5 whereas, the KOI mutation did not. Thus, the importance of the shaft for 

transduction in vivo extends to organs besides the liver. Finally, gene transfer to 
the lung, heart, and kidney was diminished with PD1 suggesting a role for 
integrin binding in vector entry in these organs. 

EXAMPLE 16 

10 Retargeting the S*, shaft modification and the 41 sF fiber in vivo 

Vectors containing the HSP mutation have been shown to effectively 
detarget adenoviral vectors in vivo (see examples 6 and 15). The objective of 
this study was to evaluate the ability to retarget vectors containing the S* 
modification or the Ad41sF to tumors in vivo, A cRGD peptide was genetically 

15 incorporated into the fiber HI loop and evaluated in vitro (Examples 8 and 14). 
These same vectors were then evaluated in vivo in tumor-bearing mice. Athymic 
nu/nu female mice were injected with 8 x 10 6 A549 cells on the right hind flank. 
When tumors reached approximately 100mm3 in size, they were randomized 
into treatment groups. Cohorts of 6 mice received each vector via tail vein 

20 injection at a dose of 1 x 10 13 particles per kg. The animals were sacrificed 
approximately 72 hours after vector administration by carbon dioxide 
asphyxiation. Tumor, liver, heart, lung, spleen, and kidney were collected from 
each animal. Tissue from each organ was frozen to preserve it for real time PCR 
analysis to determine adenoviral hexon DNA content. Hexon real-time PCR was 

25 carried out as described in example 3. A separate sample of liver from each 

mouse was frozen to preserve it for a chemiluminescent /?-galactosidase activity 
assay. Hexon real-time PCR and the chemiluminescent /?-galactosidase activity 
assay was carried out as described in example 3. 

The adenoviral vector biodistribution to the liver and tumor for each 

30 treatment group is shown in Figure 27. Vectors containing the S*, KOIS*, and 
41 sF fibers effectively detargeted the liver and tumor resulting in a significant 
reduction in the amount of adenoviral DNA found in each tissue in comparison to 
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the Av1 nBg control. Vectors containing the cRGD targeting ligand restored 
tranduction of the tumors to levels comparable to that achieved with the 
untargeted vector. 

These data demonstrate successful liver detargeting accompanied with 
5 tumor retargeting. The extent of tumor retargeting is relates to the affinity and 
type of ligand that is used. These data demonstrate the successful 
development of a targeted, systemically deliverable adenoviral vector that will 
target tumors in vivo. 

EXAMPLE 17 

10 Scale-Up Method For The Propagation Of Detargeted Adenoviral Vectors 

The growth and propagation of doubly or triply ablated adenoviral vectors 
requires novel scale up technologies. These detargeted vectors require 
alternative cellular entry strategies to allow for the efficient growth and 
generation of high titer preparations. A strategy for vector growth that is 

15 generally applicable to all detargeted adenoviral vectors, that does not require 
the development of new cell lines, and that aslo can be used for generating 
targeted vectors is provided herein. 

Three recombinant adenoviral vectors were prepared that contain single 
mutations in the fiber or penton or both mutations combined into one vector. 

20 These vectors are designated Av3nBgFKO! , Av1nBgPD1, and Av1 nBgFKOI PD1 , 
respectively. The construction of these vectors is described above and a general 
description of each vector can be found in Table 1 above. 

Scale-up of detargeted adenoviral vectors: A polycation, specifically 
hexadimethrine bromide was obtained from Sigma Chemical Co (St. Louis, MO), 

25 Catalog No. 52495, and was maintained in the medium at 4//g/ml during the 
course of transfections and infections. To illustrate the affects of 
hexadimethrine bromide on the yield of detargeted adenoviral vectors the 
following experiment was carried out. Seven plates of AE1-2a adenoviral 
producer cells (Gorziglia et al. (1 996) J. Virology 70:4173-4178) were 

30 transduced with 10 particles per cells of each of the indicated vectors (See Table 
4). Each vector was incubated with medium (Richters with 2% HI-FBS) 
containing hexadimethrine bromide at 4 //g/ml for 30 min at room temperature 



WO 03/062400 PCT/US03/02295 



-87- 

prior to infection. The infection was carried out for 2 hrs. Complete medium 
containing hexadimethrine bromide at 4//g/ml was added to each plate. Final 
concentration of hexadimethrine bromide in all of these experiments was 
maintained at 4 //g/ml. The titers were determined spectrophotometrically using 
5 the conversion of 10D at A260nm per 1 x 10 12 particles (Mittereder etaL 

(1996) J Virology 70:7498-7509). The total particle yield was then normalized 
for the number of plates used for transduction. 

The inclusion of hexadimethrine bromide in the medium during the course 
of infection allows for the efficient propagation of detargeted adenoviral vectors 

10 containing fiber and penton mutations either alone or in combination. The affect 
of hexadimethrine bromide on vector yields is shown in Table 4. A 35-fold 
improvement in the yield of Av3nBgFK01 was found when hexadimethrine 
bromide was included in the culture medium and resulted in increasing the vector 
yield from 1 .3 x 10 10 up to 4.6 x 10 11 vector particle per plate. Hexadimethrine 

15 bromide has a minimal effect on the yield of the Av1nBgPD1 adenoviral vector 
containing the penton, PD1 mutation with only a 1 .2 fold improvement. The 
greatest effect using hexadimethrine bromide was found on the propagation of 
the doubly ablated adenoviral vector, Av1 nBgFKOI PD1 with increases in vector 
yield from barely detectable levels up to 4.53 x 10 10 vector particles per plate. 

20 These data demonstrate that use of nonspecific entry mechanisms allows for the 
efficient scale-up of detargeted adenoviral vectors. 

TABLE 4 



Efficient Scale-Up Of Detargeted Adenoviral Vectors Using hexadimethrine 

bromide 



Vector 


Vector Yield (particles/plate) 


Fold 
Improvement 


{-) hexadimethrine 
bromide 


{ + ) hexadimethrine 
bromide 


AvInBg 


3.89 x 10 11 


5.72 x 10 11 


1.47 


Av3nBg 


8.58 x 10 10 


2.38 x 10 11 


2.77 


Av3nBgFK01 


1.30 x 10 10 


4.60 x 10 11 


35.4 


Av1nBgPD1 


1.95 x 10 11 


2.40 x 10 11 


1.23 


Av1 nBgFKOI PD1 


TLTC* 


4.53 x 10 10 


T 
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10 



15 



20 



25 



*TLTC: Too low to count, a faint virus band was collected and the particle 
concentration was too dilute for titer determination. 
* Significant improvement 

The use of alternative polycations including protamine sulfate and 
poly-lysine as well as bifunctional proteins such as the anti-penton:TNFa fusion 
protein was investigated. Figure 24 show results that demonstrate all the 
reagents tested had some effect on enhancing transduction of the Av3nBgFK01 
vector. All of these compounds, when maintained in the medium during 
infection, enhanced transduction of the Av3nBgFK01 detargeted adenoviral 
vector. 

Bifunctional reagents:. The use of bifunctional reagents for the 
propagation of detargeted adenoviral vectors was examined using the 
anti-penton:TNFa fusion protein. This particular reagent is a fusion protein 
between an antibody against Ad5 penton and the TNFa protein that is produced 
using stably transfected insect cells. This reagent will bind specifically to the 
adenoviral capsid via penton base and allow for binding to cell surface TNF 
receptors. The use of this reagent for the propagation of detargeted vectors is 
illustrated in Table 5 using Av3nBgFK01 (also shown in Figure 24). Monolayers 
of S8 cells were infected with 10 or 100 particles per cell of Av3nBgFK01 or a 
control vector in the presence or absence of 1 /yg/ml of the anti-penton:TNFa 
fusion protein. The monolayers were visually inspected over time for vector 
spread as indicated by the extent of cytopathic effect (CPE). The percentage of 
CPE at each time point is shown. The use of this bifunctional reagent clearly 
enhances the spread of the Av3nBgFK01 vector throughout the monolayer. 

TABLE 5 

Efficient Scale-Up Of Detargeted Adenoviral 
Vectors Using Bifunctional Reagents: Anti-Penton:TNFo 





10 ppc - 
anti-penton 
TNF 


10 ppc + 
anti-penton 
TNF 


100 ppc - 
anti-penton TNF 


100 ppc + 
anti-penton TNF 




Percentage of CPE 


AdBLud 






24 h 


0% 


0% 


0% 


0% 



30 
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10 ppc - 
anti-penton 
TNF 


10 ppc + 
anti-penton 
TNF 


100 ppc - 
anti-penton TNF 


100 ppc + 
anti-penton TNF 




Percentage of CPE 


48 h 


20-30% 


20-30% 


90-100% 


90-100% 


72 h 


60-70% 


80-90% 


100% 


100% 


120 h 


100% 


100% 


100% 


100% 


Av3nBgK01 24hrs 


24 h 


0% 


0% 


0% 


0% 


48 h 


0% 


10-20% 


0% 


90-100% 


72 h 


5% 


60-70% 


5% 


100% 


120 h 


40-50% 


100% 


100% 


100% 



10 

Since modifications will be apparent to those of skill in this art, it is 
intended that this invention be limited only by the scope of the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A modified adenovirus capsid protein, 

the unmodified capsid protein binds to heparin sulfate 
proteoglycan (HSP); and 
5 the capsid protein comprises a mutation, whereby binding to 

heparin sulfate proteoglycan (HSP) is altered. 

2. The modified protein of claim 1 that is a fiber protein 

3. The capsid protein of claim 2, wherein the binding of the modified 
fiber protein is eliminated or reduced compared to the unmodified protein. 

TO 4. The modified protein of claim 2, wherein the binding of the 

modified fiber protein is eliminated or reduced compared to the unmodified 
protein. 

5. The modified protein of claim 3 that comprises an insertion, 
deletion or replacement of amino acids. 
15 6. The modified protein of claim 2, wherein the mutation alters the 

motif that binds to HSP, whereby HSP interaction is altered. 

7. The modified protein of claim 6, motif is BBXB or BBBXXB, 
wherein the B is a basic amino acid and X is any amino acid. 

8. The modified protein of claim 7, wherein the motif comprises the 
20 consensus sequence KKTK. 

9. The modified protein of claim 2, wherein the fiber is a modified 
Ad5 or Ad2 fiber. 

10. A modified protein of claim 2 that is a chimeric fiber protein, 
comprising portions of fiber proteins from at least two different adenoviruses, 

25 wherein: 

a shaft or portion thereof is from a first adenovirus, whereby the resulting 
fiber does not bind to HSP or binds to HSP with reduced affinity compared to an 
unmodified fiber protein; 

a shaft or portion thereof from the first adenovirus does not bind 
30 to HSP or binds to HSP with reduced affinity compared to the second 
adenovirus; 

the second adenovirus binds to HSP; and 
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the portion comprises a sufficient portion to alter HSP binding of 
the resulting protein. 

1 1 . The modified protein of claim 10, wherein the binding to HSP of 
the modified fiber protein is eliminated or reduced compared to the unmodified 
5 protein. 

1 2. The modified protein of claim 10, wherein the remainder of the fiber 
protein is from the second adenovirus. 

13. The modified protein of any of claims 2, 3, 10 and 1 1, further 
comprising one or more further modifications that reduce or eliminate interaction 

10 of the resulting fiber with one or more cell surface proteins in addition to HSP. 

14. The modified protein of claim 13, further comprising a ligand, 
whereby the resulting fiber binds to a receptor for the ligand. 

15. The modified protein of claim 14, wherein the ligand is included in 
the knob region. 

15 16. The modified protein of claim 14, wherein the ligand is inserted or 

it replaces a portion of the fiber, whereby the resulting fiber binds to a receptor 
for the ligand. 

17. A modified protein of claim 1 1, wherein affinity for HSP is reduced 
at least by an amount selected from among reduced 5-fold, 10-fold and 100- 

20 fold. 

18. The modified protein of claim 1 1 , wherein the first adenovirus is 
selected from the group consisting of subgroup B, D or F, and the second is of 
subgroup C. 

19. The modified protein of claim 10, wherein the first adenovirus is 
25 selected from the group consisting of Ad3, Ad35, Ad7, Ad1 1, Ad16, Ad21, 

Ad34, Ad40, Ad41 and Ad46. 

20. The modified protein of claim 18, wherein the second adenovirus 
is Ad5 or Ad2. 

21. The modified protein of claim 19, wherein the second adenovirus 
30 is Ad5 or Ad2. 

22. A modified protein of claim 1 selected from the group consisting of 
a fiber protein comprising: 
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the sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 
58, 62, 66, 70 and 72; or 

a sequence of amino acids having 90% sequence identity with a 
sequence of amino acids set forth in any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 
5 70 and 72; or 

a sequence of amino acids encoded by a sequence of nucleotides that 
hybridizes under conditions of high stringency along at least 70% of its length to 
a sequence of nucleotides that encodes a sequence of amino acids set forth in 
any of SEQ ID Nos. 52, 54, 56, 58, 62, 66, 70 and 72. 
10 23. A nucleic acid molecule encoding a modified protein of any of 

claims 1-12 and 14-22. 

24. A nucleic acid molecule encoding a modified protein of claim 13. 

25. The nucleic acid molecule of claim 23 that comprises a vector. 

26. The nucleic acid molecule of claim 24 that comprises vector. 

15 27. The nucleic acid molecule of claim 25 that is an adenovirus vector. 

28. The nucleic acid molecule of claim 26 that is an adenovirus vector. 

29. The vector of claim 27 that is an adenoviral vector from a 
subgroup B, C or D adenovirus. 

30. The vector. of claim 28 that is an adenoviral vector from a 
20 subgroup B, C or D adenovirus. 



25 



35. 



34. 



32. 



33. 



31. 



A cell, comprising a nucleic acid molecule of claim 23. 

A cell, comprising a nucleic acid molecule of claim 24. 

The cell of claim 31 that is a eukaryotic cell. 

The cell of claim 32 that is a eukaryotic cell. 

A cell, comprising a nucleic acid molecule of claim 27, wherein: 

the cell is a eukaryotic cell; and 

the cell in a packaging cell. 



36. 



A cell, comprising a nucleic acid molecule of claim 28, wherein: 
the cell is a eukaryotic cell; and 
the cell in a packaging cell. 



30 
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37. An adenoviral particle, comprising a modified protein of any of 
claims 1-12 and 14-22, whereby binding of the viral particle to HSP is altered 
compared to a particle that expresses an unmodified fiber. 

38. An adenoviral particle, comprising a modified protein of claim 13, 

5 whereby binding of the viral particle to HSP is altered compared to a particle that 
expresses an unmodified fiber. 

39. An adenoviral particle of claim 37, wherein a native receptor for 
the fiber is coxsackie-adenovirus receptor (CAR). 

40. The adenoviral particle of claim 39, further comprising a mutation 
10 in the CAR-binding region of the capsid. 

41 . The adenoviral particle of claim 39, further comprising a mutation 
in the a v integrin-binding region of the capsid, whereby binding to the integrin is 
eliminated or reduced. 

42. The adenoviral particle of claim 40, further comprising a mutation 
15 in the a v integrin-binding region of the capsid, whereby binding to the integrin is 

eliminated or reduced 

43. The adenoviral particle of claim 39, wherein the CAR-binding 
region of the capsid modified is on a fiber knob. 

44. The adenoviral particle of claim 43, wherein the fiber knob 
20 modification is in the AB loop or CD loop. 

45. The adenoviral particle of claim 44, wherein the fiber knob 
modification is selected from the group consisting of KOI and K012. 

46. The adenoviral particle of claim 39, wherein the adenovirus is a 
subgroup C, D or F adenovirus. 

25 47. The adenoviral particle of claim 46, wherein the subgroup C virus 

is Ad2 or Ad5, the subgroup D virus is Ad46 and the subgroup F virus is Ad41 . 

48. The adenoviral vector of claim 27 that is an early generation 
adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 

30 49. The adenoviral vector of claim 28 that is an early generation 

adenoviral vector, a gutless adenoviral vector or a replication-conditional 
adenoviral vector. 
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50. The adenoviral vector of claim 48, wherein the replication-condi- 
tional adenoviral vector is an oncolytic adenoviral vector. 

51. The adenoviral vector of claim 49, wherein the replication-condi- 
tional adenoviral vector is an oncolytic adenoviral vector. 

5 52. The adenoviral vector of claim 27 that comprises heterologous 

nucleic acid. 

53. The adenoviral vector of claim 28 that comprises heterologous 
nucleic acid. 

54. The adenoviral vector of claim 52, wherein the heterologous 
10 nucleic acid encodes a polypeptide. 

55. The adenoviral vector of claim 53, wherein the heterologous 
nucleic acid encodes a polypeptide. 

56. The adenoviral vector of claim 52, wherein the heterologous 
nucleic acid comprises or encodes a regulatory nucleic acid. 

15 57. The adenoviral vector of claim 53, wherein the heterologous 

nucleic acid comprises or encodes a regulatory nucleic acid. 

58. The adenoviral vector of claim 52, wherein the heterologous 
nucleic acid comprises or encodes a promoter or RNA. 



20 nucleic acid comprises or encodes a promoter or RNA. 

60. The adenoviral vector of claim 59, wherein the promoter is a cell 
or tissue specific promoter. 

61. The adenoviral vector of claim 59, wherein the promoter is 
operably linked to a gene of an adenovirus essential for replication. 

25 62. The adenoviral vector of claim 60, wherein the tissue specific 

promoter is a tumor specific promoter. 

63. The adenoviral vector of claim 58, wherein the polypeptide is a 
therapeutic polypeptide. 

64. A method of expressing heterologous nucleic acid in a cell, 
30 comprising transducing the cell with an adenoviral vector of claim 57. 



59. The adenoviral vector of claim 53, wherein the 



heterologous 



65. The method of claim 64, wherein: 
the cell is a tumor cell; 
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the adenoviral vector is an oncolytic vector; and 
the cell is killed . 

66. The method of claim 64, wherein the cell is a mammalian cell. 

67. The method of claim 64, wherein the cell is a primate cell. 
5 68. The method of claim 67, wherein the cell is a human cell. 

69. A method of reducing transduction of liver cells by an adenoviral 
particle, comprising reducing or eliminating binding of the particle to heparin 
sulfate proteoglycans (HSPs) on the liver cells. 

70. A scale up method for the propagation of a detargeted adenoviral 
10 particle, comprising: 

infecting a cell capable of replicating, maturing and packaging an 
adenoviral vector with a detargeted adenoviral vector in the presence of a 
reagent that results in entry of the adenoviral particle into the cell; 

culturing the infected cell under conditions suitable for growth, spread 
1 5 and propagation of the adenoviral vector; and 

recovering the resulting adenoviral particles. 

71 . The method of claim 70, wherein the reagent is a polycation. 

72. The method of claim 71, wherein the polycation is selected from 
the group consisting of hexadimethrine bromide, polyethylenimine, protamine 

20 sulfate and poly-L-lysine. 

73. The method of claim 70, wherein the reagent is a bifunctional 
protein that binds to the adenoviral particle and to a receptor on the cell. 

74. The method of claim 73, wherein: 

the bifunctional protein is selected from the group consisting of an 
25 anti-fiber antibody ligand fusion, an anti-fiber-Fab-FGF conjugate, an 

anti-penton-antibody ligand fusion, an anti-hexon antibody ligand fusion and a 
polylysine-peptide fusion, wherein the ligand is a ligand that binds to the 
receptor. 

75. The method of any one of claims 70-74, wherein the detargeted 
30 adenoviral particle expresses a modified capsid, whereby binding to at least one 

host cell receptor is reduced or eliminated compared with a wild-type 
adenovirus. 
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76. The method of claim 75, wherein the adenoviral particle is 
modified to eliminate or reduce binding with one host cell receptor. 

77. The method of claim 75, wherein the adenoviral particle is 
modified to eliminate or reduce binding with two host cell receptors. 

5 78. The method of claim 75, wherein the adenoviral particle is 

modified to eliminate or reduce binding with three host cell receptors. 

79. The, method of claim 75, wherein the particle is modified with one 
or more mutations selected from the group consisting of mutations that reduce 
or eliminate interactions with one or more of cr v integrins, coxsackie-adenovirus 

10 receptors (CAR), and heparin sulfate proteoglycans (HSP). 

80. The method of claim 79, wherein the mutation is selected from the 
group consisting of PD1 , KOI, K012 and S*. 

81 . The modified protein of claim 2, wherein the mutation is in the 
shaft of a fiber. 

15 82. A modified protein of claim 3, wherein affinity for HSP is reduced 

at least by an amount selected from among reduced 5-fold, 10-fold and 100- 
fold. 
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SEQUENCE LISTING 
<110> The Scripps Research Institute 
Novartis AG 
Kaleko, Michael 
Nemerow, Glen R. 
Smith, Theodore 
Stevenson, Susan C. 

<120> Fiber Shaft Modifications for Efficient Targeting 

<130> 22908-1236PC 

<140> Not yet assigned 
<141> Herewith 

<150> 60/350,388 
<151> 2002-01-24 

<150> 60/391,967 
<151> 2002-06-26 

<160> 72 

<170> Patentln version 3.0 

<210> 1 

<211> 4 

<212> PRT 

<213> adenovirus serotype 5 

<400> 1 
Lys Lys Thr Lys 
1 

<210> 2 
<211> 1746 
<212> DNA 

<213> adenovirus serotype 5 
<400> 2 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata 
accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa 
caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga 
caactaaatc taagactagg acagggccct ctttttataa actcagccca 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga 
ttgtggacca caccagctcc agaggctaac tgtagactaa atgcagagaa 



tgacacggaa 


60 


tgggtttcaa 


120 


ctccaatggc 


180 


ccttacctcc 


240 


aaacctggaa 


300 


cgcacctcta 


360 


gcacgactcc 


420 


agccctgcaa 


480 


ctcaccccct 


540 


tacacaaaat 


600 


aaacactttg 


660 


taaagttact 


720 


aggactaagg 


780 


tgctcaaaac 


840 


caacttggat 


900 


aaagcttgag 


960 


cattaatgca 


1020 


caaaacaaaa 


1080 


aggaactggc 


1140 


taagctaact 


1200 


agatgctaaa 


1260 
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ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



<210> 3 

<211> 1746 

<212> DNA 

<213> adenovirus serotype 5 



<400> 3 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

baactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 

gttaaaggca 

tttgacgaaa 

agaaatggag 

ctatcagctt 

tacttaaacg 

gaaacaggag 

cacaactaca 

gaataa 



caagaccgtc 

caactgtgcc 

ctggggtact 

tcaaaatggg 

ccactgtgag 

ccctcacagt 

gcaacacact 

ttgccaccca 

ccctcaccac 

ccactggtag 

gactaaagta 

ctggtccagg 

gttttgattc 

aaaacagacg 

taagactagg 

acaaaggcct 

gcactgccaa 

ttgaatttgg 

gcctagaatt 

acagcacagg 

caccagctcc 

tcttaacaaa 

gtttggctcc 

atggagtgct 

atcttactga 

atccaaaatc 

gagacaaaac 

acacaactcc 

ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
atctcctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 

cctccctttg 

ctatccgaac 

tctctggacg 

aaaaaaacca 

gccctaactg 

tcacaggccc 

acagtgtcag 

agtaccctta 

gacttgaaag 

ttgcatgtaa 

aataatactt 

atgcaactta 

gatgttagtt 

ctttttataa 

acagcttcaa 

tttgacgcta 

gcaccaaaca 

aaggctatgg 

gtaggaaaca 

tgttcactaa 

caaatacttg 

acagttcaaa 

tccttcctgg 

tatacaaacg 

actgccaaaa 

acactaacca 

tctatgtcat 

tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cage cat age 
caaatcccct 
ttcctaaact 
aaaataatga 
atggaggcgg 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgea 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgetaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1746 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1746 



<210> 4 

<211> 1737 

<212> DNA 

<213> adenovirus serotype 5 



<400> 4 

atgcggcgcg 

gcggcgccag 

gtgcctccgc 

gcacccctat 

tccctgaact 

tacagcccgg 

ggcgacctga 

aataagttta 



cggcgatgta 
tggeggegge 
ggtacctgcg 
tcgacaccac 
accagaacga 
gggaggcaag 
aaaccafccct 
aggegegggt 



tgaggaaggt 
gctgggttct 
gcctaccggg 
ccgtgtgtac 
ccacagcaac 
cacacagacc 
gcataccaac 
gatggtgtcg 



cctcctccct 
cccttcgatg 
gggagaaaca 
ctggtggaca 
tttctgacca 
atcaatcttg 
atgccaaatg 
cgcttgccta 



cctacgagag 
ctcccctgga 
geatcegtta 
acaagtcaac 
eggtcattea 
acgaccggtc 
tgaacgagtt 
ctaaggacaa 



tgtggtgagc 
cccgccgttt 
ctctgagttg 
ggatgtggca 
aaacaatgac 
gcactggggc 
catgtttacc 
tcaggtggag 



60 
120 
180 
240 
300 
360 
420 
480 
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ctgaaatacg 

atagacctta 

gttctggaaa 

cccgtcactg 

attttgctgc 

atccgcaagc. 

ggtaacattc 

gaacagggcg 

tccaacgcgg 

ccctacgacg 

gaggccgaag 

cagaagaaac 

ataagcaatg 

gaccctcaga 

tcggagcagg 

acgcgccaga 

agcttctaca 

cacgtgttca 

accaccgtca 

agcatcggag 

gtttacaagg 



agtgggtgga 
tgaacaacgc 
gcgacatcgg 
gtcttgtcat 
caggatgcgg 
ggcaaccctt 
ccgcactgtt 

ggggtggcgc 

cagccgcggc 
tgcccgacta 
cagcggccga 
cggtgatcaa 
acagcacctt 
ccggaatccg 
tctactggtc 
tcagcaactt 
acgaccaggc 
atcgctttcc 
gtgaaaacgt 
gagtccagcg 
ccctgggcat 



gttcacgctg 

gatcgtggag 

ggtaaagttt 

gcctggggta 

ggtggacttc 

ccaggagggc 

ggatgtggac 

aggcggcagc 

aatgcagccg 

cgcgggcacc 

agctgccgcc 

acccctgaca 

cacccagtac 

ctcatggacc 

gttgccagac 

tccggtggtg 

cgtctactcc 

cgagaaccag 

tcctgctctc 

agtgaccatt 

agtctcgccg 



cccgagggca 

cactacttga 

gacacccgca 

tatacaaacg 

acccacagcc 

tttaggatca 

gcctaccagg 

aacagcagtg 

gtggaggaca 

agcgccacac 

cccgctgcgc 

gaggacagca 

cgcagctggt 

ctgctttgca 

atgatgcaag 

ggcgccgagc 

caactcatcc- 

attttggcgc 

acagatcacg 

actgacgcca 

cgcgtcctat 



actactccga 

aagtgggcag 

acttcagact 

aagccttcca 

gcctgagcaa 

cctacgatga 

cgagcttgaa 

gcagcggcgc 

tgaacgatag 

gggctgagga 

aacccgaggt 

agaaacgcag 

accttgcata 

ctcctgacgt 

accccgtgac 

tgttgcccgt 

gccagtttac 

gcccgccagc 

ggacgctacc 

gacgccgcac 

cgagccgcac 



gaccatgacc 

acagaacggg 

ggggtttgac 

tccagacatc 

cttgttgggc 

tctggagggt 

agatgacacc 

ggaagagaac 

ccgcggctac 

gaagcgcgct 

cgagaagcct 

ttacaaccta 

caactacggc 

aacctgcggc 

cttccgctcc 

gcactccaag 

ctctctgacc 

ccccaccatc 

gctgcgcaac 

ctgcccctac 

tttttga 



<210> 5 
<211> 20 
<212> DNA 

<213> adenovirus serotype 5 
<400> 5 

gaacaggagg tgagcttaga 

<210> 6 
<211> 43 
<212> DNA 

<213> adenovirus serotype 5 
<400> 6 

tccgcctcca tttagtgaac agttaggaga tggagctggt gtg 

<210> 7 
<211> 44 
<212> DNA 

<213> adenovirus serotype 5 
<400> 7 

tcactaaatg gaggcggaga tgctaaactc actttggtct taac 

<210> 8 
<211> 20 
<212> DNA 

<213> adenovirus serotype 5 
<400> 8 

gtggcaggtt gaatactagg 

<210> 9 

<211> 8 

<212> PRT 

<213> adenovirus serotype 5 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1737 



20 



43 



44 



20 



<400> 9 

His Ala lie Arg Gly Asp Thr Phe 
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<210> 10 

<21X> 15 

<212> PRT 

<213> Artificial Sequence 

<220> ^ . 

<223> mofified sequence for penton protean 

Ser°Arg 1 Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Thr Ser 
1 5 10 15 

<210> 11 

<211> 57 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

cgcgjaagag aactccaacg cggcagccgc ggcaatgcag ccggtggagg acatgaa 57 

<210> 12 

<211> 59 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

tatcgttcat gtcctccacc ggctgcattg ccgcggctgc cgcgttggag ttctcttcc 59 

<210> 13 
<211> 75 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

cjatagccjc ggctacccct acgacgtgcc cgactacgcg ggcaccagcg ccacacgggc 60 
tgaggagaag cgcgc 

<210> 14 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

tcagcgcgct tctcctcagc ccgtgtggcg ctggtgcccg cgtagtcggg cacgtcgtag 60 
gggtagccgc ggc 

<210> 15 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 
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<400> 15 

ggctccggct ccgagaggtg ggctcacagt ggttacattt 40 

<210> 16 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide for mutation generation 

<400> 16 

ggagccggag cctcaaacat aaacctggaa at oo 

<210> 17 32 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 17 

ctctagaaat ggacggaatt attacag 27 

<210> 18 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 18 

tcttggtcat ctgcaacaac atgaagatag tg 32 

<210> 19 

<211> 32 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 19 

gttgttgcag atgaccaaga gagtccggct ca 32 

<210> 20 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 
<400> 20 

agcaattgaa aaataaacac gttgaaacat aacacaaacg attctttagt tgtcatcttc so 
tgtaatgtaa gaa 3 3 

<210> 21 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<400> 21 
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agcaattgaa aaataaacac gttg 24 

<210> 22 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplif ication primer 

<400> 22 

gaacaggagg tgagcttaga 20 

<210> 23 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 23 

gttaggtgga gggtttattc cggtccacaa agttagctta tc 42 

<210> 24 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 24 

gataagctaa ctttgtggac cggaataaac cctccaccta ac 42 

<210> 25 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 25 

3tggcaggtt gaatactagg 20 

<210> 26 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 26 

gttaggagat ggagctggtg tagtccataa ggtgttaata c 41 

<210> 27 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<400> 27 

gtattaacac cttatggact acaccagctc catctcctaa c 



41 
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<210> 28 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 28 

tgcgcaaaaa caatcaccac gacaatcaca atgtacattg gaagaaatca tacg 54 

<210> 29 

<211> 54 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<210> 30 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> RGD targeting peptide 

<400> 30 

His Cys Asp Cys Arg Gly Asp Cys Phe Cys 
15 10 

<210> 31 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 31 

ttcttttcat ctgcaacaac atgaagatag tg 32 

<210> 32 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<210> 33 

<211> 73 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 



<400> 29 

acattgtgat tgtcgtggtg attgtttttg cgcatatgcc atacaatttg aatg 



54 



<400> 32 

gttgttgcag atgaaaagaa ccagaattga ag 



32 



<400> 33 

tgcaattgaa aaataaacac gttgaaacat aacacaaacg attctttatt cttcagttat 
gtagcaaaat aca 



60 
73 
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<210> 34 

<211> 56 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 34 

agtacaaaaa caatcaccac gacaatcaca gtttatctcg ttgtagacga cactga . 56 

<210> 35 

<211> 51 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 35 

tgtgattgtc gtggtgattg tttttgtact agtgggtatg cttttacttt t 51 

<210> 36 

<211> 4 

<212> PRT 

<213> Adenovirus type 5 



<400> 36 

Thr Leu Trp Thr 

1 



<210> 37 

<211> 7 

<212> PRT 

<213> SV40 



<400> 37 

Pro Lys Lys Lys Arg Lys Val 
1 5 



<210> 38 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 38 

cttcgatgat gccgcagtg 19 

<210> 39 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> amplification primer 

<400> 39 

gggctcaggt actccgagg 19 

<210> 40 

<211> 25 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> amplification primer 
<400> 40 

ttacatgcac atctcgggcc aggac 25 

<210> 41 
<211> 7607 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid GRE5-El-SV40-Hygro 
<400> 41 

tctagaagat ccgctgtaca ggatgttcta gctactttat tagatccgct gtacaggatg 60 
ttctagctac tttattagat ccgctgtaca ggatgttcta gctactttat tagatccgct 120 
gtacaggatg ttctagctac tttattagat ccgtgtacag gatgttctag ctactttatt 180 
agatcgatct cctggccgtt cggggtcaaa aaccaggttt ggctataaaa gggggtgggg 240 
gcgcgttcgt cctcactctc ttccgcatcg ctgtctgcga gggccaggat cgatcctgag 300 
aacttcaggg tgagtttggg gacccttgat tgttctttct ttttcgctat tgtaaaattc 360 
atgttatatg gagggggcaa agttttcagg gtgttgttta gaatgggaag atgtcccttg 420 
tatcaccatg gaccctcatg ataattttgt ttctttcact ttctactctg ttgacaacca 480 
ttgtctcctc ttattttctt ttcattttct gtaacttttt cgttaaactt tagcttgcat 540 
ttgtaacgaa tttttaaatt cacttttgtt tatttgtcag attgtaagta ctttctctaa 600 
tcactttttt ttcaaggcaa tcagggtata ttatattgta cttcagcaca gttttagaga 660 
acaattgtta taattaaatg ataaggtaga atatttctgc atataaattc tggctggcgt 720 
ggaaatattc ttattggtag. aaacaactac atcctggtca tcatcctgcc tttctcttta 780 
tggttacaat gatatacact gtttgagatg aggataaaat actctgagtc caaaccgggc 840 
ccctctgcta accatgttca tgccttcttc tttttcctac agctcctggg caacgtgctg 900 
gttattgtgc tgtctcatca ttttggcaaa gaattagatc taagcttctg cagctcgagg 960 
actcggtcga ctgaaaatga gacatattat ctgccacgga ggtgttatta ccgaagaaat 102 0 
ggccgccagt cttttggacc agctgatcga agaggtactg gctgataatc ttccacctcc 1080 
tagccatttt gaaccaccta cccttcacga actgtatgat ttagacgtga cggcccccga 114 0 
agatcccaac gaggaggcgg tttcgcagat ttttcccgac tctgtaatgt tggcggtgca 1200^ 
ggaagggatt gacttactca cttttccgcc ggcgcccggt tctccggagc cgcctcacct 1260 
ttcccggcag cccgagcagc cggagcagag agccttgggt ccggtttcta tgccaaacct 1320 
tgtaccggag gtgatcgatc ttacctgcca cgaggctggc tttccaccca gtgacgacga 1380 
ggatgaagag ggtgaggagt ttgtgttaga ttatgtggag caccccgggc acggttgcag 1440 
gtcttgtcat tatcaccgga ggaatacggg ggacccagat attatgtgtt cgctttgcta 1500 
tatgaggacc tgtggcatgt ttgtctacag taagtgaaaa * ttatgggcag tgggtgatag 1560 
agtggtgggt ttggtgtggt aatttttttt ttaattttta cagttttgtg gtttaaagaa 162 0 
ttttgtattg tgattttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1680 
gaaccggagc ctgcaagacc tacccgccgt cctaaaatgg cgcctgctat cctgagacgc 1740 
ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1800 
tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1860 
gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagcct 1920 
gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1980 
cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 2040 
ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 210 0 
cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 2160 
tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 2220 
ctgtggggct catcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 2280 
tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 2340 
gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 2400 
gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2460 
SSrggggtacc tgctggattt tctggccatg catctgtgga gagcggttgt gagacacaag 2520 
aatcgcctgc tactgttgtc ttccgtccgc ccggcgataa taccgacgga ggagcagcag 2580 
cagcagcagg aggaagccag gcggcggcgg caggagcaga gcccatggaa cccgagagcc 2640 
ggcctggacc ctcgggaatg aatgttgtac aggtggctga actgtatcca gaactgagac 2700 
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gcattttgac aattacagag gatgggcagg ggctaaaggg ggtaaagagg gagcgggggg 2760 
cttgtgaggc tacagaggag gctaggaatc tagcttttag cttaatgacc agacaccgtc 2820 
ctgagtgtat tacttttcaa cagatcaagg ataattgcgc taatgagctt gatctgctgg 2880 
cgcagaagta ttccatagag cagctgacca cttactggct gcagccaggg gatgattttg 2940 
aggaggctat tagggtatat gcaaaggtgg cacttaggcc agattgcaag tacaagatca 3 000 
gcaaacttgt aaatatcagg aattgttgct acatttctgg gaacggggcc gaggtggaga 3 060 
tagatacgga ggatagggtg gcctttagat gtagcatgat aaatatgtgg ccgggggtgc 3120 
ttggcatgga cggggtggtt attatgaatg taaggtttac tggccccaat tttagcggta 3180 
cggttttcct ggccaatacc aaccttatcc tacacggtgt aagcttctat gggtttaaca 3240 
atacctgtgt ggaagcctgg accgatgtaa gggttcgggg ctgtgccttt tactgctgct 3300 
ggaagggggt ggtgtgtcgc cccaaaagca gggcttcaat taagaaatgc ctctttgaaa 3360 
ggtgtacctt gggtatcctg tctgagggta actccagggt gcgccacaat gtggcctccg 3420 
actgtggttg cttcatgcta gtgaaaagcg tggctgtgat taagcataac atggtatgtg 3480 
gcaactgcga ggacagggcc tctcagatgc tgacctgctc ggacggcaac tgtcacctgc 3540 
tgaagaccat tcacgtagcc agccactctc gcaaggcctg gccagtgttt gagcataaca 3600 
tactgacccg ctgttccttg catttgggta acaggagggg ggtgttccta ccttaccaat 3660 
gcaatttgag tcacactaag atattgcttg agcccgagag catgtccaag gtgaacctga 3720 
acggggtgtt tgacatgacc atgaagatct ggaaggtgct gaggtacgat gagacccgca 3780 
ccaggtgcag accctgcgag tgtggcggta aacatattag gaaccagcct gtgatgctgg 3 840 
atgtgaccga ggagctgagg cccgatcact tggtgctggc ctgcacccgc gctgagtttg 3900 
gctctagcga tgaagataca gattgaggta ctgaaatgtg tgggcgtggc ttaagggtgg 3960 
gaaagaatat ataaggtggg ggtcttatgt agttttgtat ctgttttgca gcagccgccg 4020 
ccgccatgag caccaactcg tttgatggaa gcattgtgag ctcatatttg acaacgcgca 4080 
tgcccccatg ggccggggtg cgtcagaatg tgatgggctc cagcattgat ggtcgccccg 4140 
tcctgcccgc aaactctact accttgacct acgagaccgt gtctggaacg ccgttggaga 4200 
ctgcagcctc cgccgccgct tcagccgctg cagccaccgc ccgcgggatt gtgactgact 4260 
ttgctttcct gagcccgctt gcaagcagtg cagcttcccg ttcatccgcc cgcgatgaca 4320 
agttgacggc tcttttggca caattggatt ctttgacccg ggaacttaat gtcgtttctc 4380 
agcagctgtt ggatctgcgc cagcaggttt ctgccctgaa ggcttcctcc cctcccaatg 4440 
cggtttaaaa cataaataaa aaaccagact ctgtttggat ttggatcaag caagtgtctt 4500 
gctgtctcag ctgactgctt aagtcgcaag ccgaattgga tccaattcgg atcgatctta 4560 
ttaaagcaga acttgtttat tgcagcttat aatggttaca aataaagcaa tagcatcaca 4620 
aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc caaactcatc 4680 
aatgtatctt atcatgtctg gtcgactcta gactcttccg cttcctcgct cactgactcg 4740 
ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg 4800 
ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag 4860 
gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac 4920 
gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga 4980 
taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt 5040 
accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc 5100 
tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc 5160 
cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta 5220 
agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat 5280 
gtaggcggtg ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca 5340 
gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct 5400 
tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt 5460 
acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct 5520 
cagtggaacg aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc 5580 
acctagatcc ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa 5640 
acttggtctg acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta 5700 
tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc 5760 
ttaccatctg gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat 582 0 
ttatcagcaa taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta 5880 
tccgcctcca tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt 5940 
aatagtttgc gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt 6000 
ggtatggctt cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg 6060 
ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc 6120 
gcagtgttat cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc 6180 
gtaagatgct tttctgtgac tggtgagtab tcaaccaagt cattctgaga atagtgtatg 6240 
cggcgaccga gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga 6300 
acttfcaaaag tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta 6360 
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ccgctgttga gatccagttc gatgtaaccc 
tttactttca ccagcgtttc tgggtgagca 
ggaataaggg cgacacggaa atgttgaata 
agcatttatc agggttattg tctcatgagc 
aaacaaatag gggttccgcg cacatttccc 
attattatca tgacattaac ctataaaaat 
gcgtttcggt gatgacggtg aaaacctctg 
ttgtctgtaa gcggatgccg ggagcagaca 
cgggtgtcgg ggctggctta actatgcggc 
tatgcggtgt gaaataccgc acagatgcgt 
agcgttaata ttttgttaaa attcgcgtta 
caataggccg aaatcggcaa aatcccttat 
agtgttgttc cagtttggaa caagagtcca 
gggcgaaaaa ccgtctatca gggcgatggc 
tttttggggt cgaggtgccg taaagcacta 
agagcttgac ggggaaagcc ggcgaacgtg 
gcgggcgcta gggcgctggc aagtgtagcg 
gcgcttaatg cgccgctaca gggcgcgtcc 
gaagggcgat cggtgcgggc ctcttcgcta 
gcaaggcgat taagttgggt aacgccaggg 
gccagtgaat tgtaatacga ctcactatag 

<210> 42 
<211> 11600 
<212> DNA 

<213> Artificial Sequence 



actcgtgcac ccaactgatc ttcagcatct 6420 
aaaacaggaa ggcaaaatgc cgcaaaaaag 6480 
ctcatactct tcctttttca atattattga 654 0 
ggatacatat ttgaatgtat ttagaaaaat 6600 
cgaaaagtgc cacctgacgt ctaagaaacc 6660 
aggcgtatca cgaggcccct ttcgtctcgc 6720 
acacatgcag ctcccggaga cggtcacagc 6780 
agcccgtcag ggcgcgtcag cgggtgttgg 6840 
atcagagcag attgtactga gagtgcacca 6900 
aaggagaaaa taccgcatca ggaaattgta 6960 
aatttttgtt aaatcagctc attttttaac 7020 
aaatcaaaag aatagaccga gatagggttg 7080 
ctattaaaga acgtggactc caacgtcaaa 7140 
ccactacgtg aaccatcacc ctaatcaagt 7200 
aatcggaacc ctaaagggag cccccgattt 7260 
gcgagaaagg aagggaagaa agcgaaagga 7320 
gtcacgctgc gcgtaaccac cacacccgcc 7380 
cattcgccat tcaggctgcg caactgttgg 7440 
ttacgccagc tggcgaaagg gggatgtgct 7500 
ttttcccagt cacgacgttg taaaacgacg 7560 
ggcgaattaa ttcgggg 7607 



<220> 

<223> Plasmid MMTV-E2a-SV40-Neo 



<400> 42 

gaattccgca ttgcagagat attgtattta 
tgaccattca ccacattggt gtgcacctcc 
gagtgtccta cacctagggg agaagcagcc 
tctgcgcaca aacggatgag cccatcagac 
ggcatagctc tgctttgcct ggggctattg 
ctcacccttg actcttttaa tagctcttct 
aactcgacct tcctcctgag gcaaggacca 
ttttgtcctt cagaaataga aataagaatg 
accaatccaa taggtagatt attagttact 
actattttta ctcaaattca gaagttagaa 
acctcaattg aagaacaggt gcaaggacta 
aaaaagagtg tttttgtcaa aataggagac 
ccttacatct acagaccaac agatgccccc 
tgggataggt gggttacagt caatggctat 
gaaagactcg ccagagctag acctccttgg 
atgaaacaac aggtacatga ttatatttat 
attttccata ccaaggaggg gacagtggct 
catggcatga gttattatga atagccttta 
agtaagtttt tggttacaaa ctgttcttaa 
cttggtttgg tatcaaaggt tctgatctga 
ttggaattta tccaaatctt atgtaaatgc 
attttttgag taaacttgca acagtcctaa 
gccatcccgt ctccgctcgt cacttatcct 
cggcgaccct caggtcggcc gactgcggca 
gtgacccttg tctctatttc tactatttgg 
ggctatcatc acaagagcgg aacggactca 
cgtccaagac cctcaaagat ttttggcact 
gcgccctgcc gcaaggccag ctgcttgtcc 
ggtatcttgc agttttggaa aaagatgtga 
acggggtaga agttgaggcg cgggttgggc 



agtgcctagc tcgatacaat aaacgccatt 60 
aagcttgggc agaaatggtt gaactcccga 120 
aaggggttgt ttcccaccaa ggacgacccg 180 
aaagacatat tcattctctg ctgcaaactt 240 
ggggaagttg cggttcgtgc tcgcagggct 300 
gtgcaagatt acaatctaaa caattcggag 360 
cagccaactt cctcttacaa gccgcatcga 420 
cttgctaaaa attatatttt taccaataag 480 . 
atgttaagaa atgaatcatt- atcttttagt 540 
atgggaatag aaaatagaaa gagacgctca 600 
ttgaccacag gcctagaagt aaaaaaggga 66 D 
aggtggtggc aaccagggac ttatagggga 720 
ttaccatata caggaagata tgacttaaat 780 
aaagtgttat atagatccct cccttttcgt 840 
tgtatgttgt ctcaagaaga aaaagacgac 900 
ctaggaacag gaatgcactt ttggggaaag 960 
ggactaatag aacattattc tgcaaaaact 1020 
ttggcccaac cttgcggttc ccagggctta 1080 
aacgaggatg tgagacaagt ggtttcctga 1140 
gctctgagtg ttctattttc ctatgttctt 1200 
ttatgtaaac caagatataa aagagtgctg 1260 
cattcacctc ttgtgtgttt gtgtctgttc 1320 
tcactttcca gagggtcccc ccgcagaccc 1380 
gctggcgccc gaacagggac cctcggataa 1440 
tgtttgtctt gtattgtctc tttcttgtct 1500 
ccatagggac caagctagcg cttctcgtcg 1560 
tcgttgagcg aggcgatatc aggtatgaca 1620 
gctcggctgc ggttggcacg gcaggatagg 1680 
taggtggcaa gcacctctgg cacggcaaat 1740 
tcgcatgtgc cgttttcttg gcgtttgggg 1800 
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ggtacgcgcg 

ggcacatcgc 

ttcaacagca 

acttgttcct 

gagcggtcct 

tcctcaagcg 

gcggtggtgg 

atctttttct 

acccccgagc 

ccgtccccgt 

gagtccgagg 

cggccatcga 

tctcccaagc 

gaagatgtgg 

aaaggaggta 

cggacgcaag 

ccgctgagtg 

atggacaagt 

gtggaagctc 

ctgaccttca 

ctgcagtcgt 

ctgcaccgct 

aataaggagc 

gage agt eta 

tccaacaccg 

ggcaagtctt 

atcaaggctt 

atgccactac 

ctaccaaagt 

tccgacaaga 

aaccctgtgt 

tcggcgcccg 

accgagctgc 

aacgtgtccc 

gegcagaegg 

cctttattga 

gtggcgctcc 

ctgtagagct 

gagttgggta 

cagttggcaa 

ccctgtacgc 

gcgctacggc 

gaaaagccaa 

atccagtgta 

tgtggagaaa 

ccaagatctt 

gttgatgttg 

acaeggttte 

acaaaacata 

ttacctcagg 

aattggacaa 

tataatgtgt 

ctgatgaatg 

aaatgecate 

agaagagaaa 

atgctgtgtt 

ctgcactgct 

acagttataa 

ttaataacta 

aggaatattt 

gaggttttac 



gtgagaatag 

tgcgctcttg 

cgtcgtctcc 

cgtttgcctc 

cgtcgtcttc 

ggggtgcctc 

cgaactcaga 

gectatagga 

geggacgegg 

cgccgccgcc 

acgaggaaga 

cctcggcggc 

ccgagcgccc 

cgctacaaat 

agegcacagt 

aggaagagga 

tgccgatcgt 

accacgtgga 

tggcggccgt 

ccagcaacaa 

ttgcagaggt 

gegctgagat 

acgtgattga 

geaaggecaa 

aegcaaggtg 

gcggcatgtt 

ttatgeagge 

ggtgcgagtg 

tgactccgtt 

gcgtgctggc 

atcgcaactc 

acctgetaaa 

cgcggatggt 

tgccagtggc 

caagggtggg 

aagtgtctct 

taatctgege 

gttcctggtt 

ccccggtaat 

aggcgtggag 

tttgggtgga 

gcaggagtga 

agggctcaaa 

cgcccccagt 

caaagectgg 

tgacaatggc 

ettgettett 

gatgacgecg 

aagaagggtg 

tcgacctcga 

actacctaca 

taaactactg 

ggagcagtgg 

tagtgatgat 

ggtagaagac 

tagtaataga 

atacaagaaa 

tcataacata 

tgctcaaaaa 

gatgtatagt 

ttgctttaaa 



gtggcgttcg 
caacgcgtcg 
cacatctagg 
tgcgttgtcc 
gcttacaaaa 
gaeggggaag 

gggggcggtt 

gaaggaaatg 

tgeggegega 

tccccgggcg 

ctcatcacaa 

ggatttggcc 

gccatcacca 

ggtgggtttc 

gcggcggctg 

agagcccagc 

gtctgcgtgg 

taacgatcta 

atgeaagace 

gacctttgtg 

gacctacaag 

egaaggegag 

aatggatgtg 

gatcgtgaag 

ctgcgtgcac 

cttctctgaa 

gctgtatcct 

caactcaaag 

cgccctgagc 

cagcgtgcac 

gcgcgcgcag 

cgcgttggtg 

tgtgcctgag 

geatagegat 

ggtaaataat 

agt ac at tat 

actgtggctg 

gegaegcagg 

aaggttcatg 

aaacatgcag 

cttttccagc 

ctcgtactca 

gaggtagcat 

ctcgcgaccg 

aaagcgcttg 

tttcagttcc 

tatgttgtgg 

cggtgcggcc 

ggctcgtcca 

gggatctttg 

gagatttaaa 

attctaattg 

tggaatgect 

gaggctactg 

cccaaggact 

actcttgett 

attatggaaa 

ctgttttttc 

ttgtgtacct 

gecttgacta 

aaacctccca 



taggcaaggc 

cagataatgg 

tagtcgecat 

tggtcttget 

cctgggtcct 

gtggtaggcg 

aggctgtcct 

gecagteggg 

cgtcccccaa 

cccccaaaaa 

gaegegctgg 

attgcgccca 

gaggtaatcg 

agcaacccac 

aatgaagacg 

gaagcggaaa 

gagaagggca 

aaggegaact 

tggctgaacg 

acgatgatgg 

catcacgagc 

cttaagtgtc 

a egagegaaa 

aaccggtggg 

gacgcggcct 

ggegcaaagg 

aacgcccaga 

cctgggcacg 

aacgeggagg 

cacccggcgc 

ggeggaggee 

atggtgcgca 

tttaagtgga 

gegeggcaga 

cacccgagag 

ttttacatgt 

eggaagtagg 

gtgggctgta 
gtggggttgt 
cagaatagtc 
gttatacagc 
aactggtaaa 
gtttttgagt 
geegtattga 
tcataggtgc 
tgctcactgg 
cgttgccggc 
ggtgcacacg 
tgggatccat 
tgaaggaacc 
gctctaaggt 
tttgtgtatt 
ttaatgagga 
ctgactctca 
ttccttcaga 
getttgetat 
aatattctgt 
ttactccaca 
ttagcttttt 
gagatcataa 
cacctccccc 



tgacatccgc 

cgcactggcg 

gcctttcgtc 

ttttatcctc 

gctcgataat 

cgttggcggc 

tcttctcgac 

aagaggagca 

ccatggagga 

ageggatgag 

tgccgcgcac 

agaagaaaaa 

tggacagega 

cggtgctaat 

acccagtggc 

gtgaaattac 

tggaggctgc 

tcaaactact 

aggagcaccg 

ggegattect 

ccacgggctg 

tacaeggaag 

aegggcageg 

gecgaaatgt 

gtccggccaa 

ctcaggtggc 

ccgggcacgg 

cgcccttttt 

acctggacgc 

tgatagtgtt 

ccaactgcga 

gcctgtggag 

gcactaaaca 

acccctttga 

tgtacaaata 

ttttcaagtg 

gcgagtggcg 

cctggggact 

gatccatggg 

cacaggcggc 

ggtcggggga 

ectgettgag 

gcgggttcca 

etatggegea 
ccaaaaaata 
agcccatggc 
cgagaagggc 
gaccacgtca 
atatagggee 
ttacttctgt 
aaatataaaa 
ttagattcca 
aaacctgttt 
acattctact 
attgetaagt 
ttacaccaca 
aacctttata 
caggcataga 
aatttgtaaa 
tcagccatac 
tgaacctgaa 



tatggegagg 1860 
ctgcagatgc 1920 
cccccgcccg 1980 
tgttggtact 2040 
cacttcctcc 2100 
atcggtggag 2160 
tgactccatg 2220 
gcgcgaaacc 2280 
cgtgtcgtcc 2340 
geggegtate 2400 
acccagcccg 2460 
gaagcgccct 2520 
ggaagaaaga 2580 
caagcatggc 2640 
gcgtggtatg 2700 
ggtgatgaac 2760 
gcgcgcgctg 2820 
gcctgaccaa 2880 
cgggttgcag 2940 
geaggegtae 3000 
cgcgttgtgg 3060 
cattatgata 3120 
cgcgctgaag 3180 
ggtgcagatc 3240 
tcagttttcc 3300 
ttttaagcag 3360 
tcaccttttg 3420 
gggaaggcag 3480 
ggatctgatc 3540 
ccagtgctgc 3600 
cttcaagata 3660 
tgaaaacttc 3720 
ccagtatcgc 3780 
tttttaaacg 3840 
aaagcatttg 3900 
acaaaaagaa 3960 
ctccaggaag 4020 
gttgagcatg 4080 
agtttggggc 4140 
cgagttgggc 4200 
agaagcaatg 4260 
tegctggtea 4320 
ggcaaaggee 4380 
ggcgagcttg 4440 
tggcccacaa 4500 
ggcagctgtt 4560 
gtgcgcaggt 4620 
aagacttcaa 4680 
cgggttataa 4740 
ggtgtgacat 4800 
tttttaagtg 4860 
acctatggaa 4920 
tgctcagaag 4980 
cctccaaaaa 5040 
tttttgagtc 5100 
aaggaaaaag 5160 
agtaggcata 5220 
gtgtctgcta 5280 
ggggttaata 5340 
cacatttgta 5400 
acataaaatg 5460 
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aatgcaattg 

agcatcacaa 

aaactcatca 

tgtggaaagt 

tcagcaacca 

catctcaatt 

ccgcccagtt 

gccgaggccg 

ctaggctttt 

ggaagcggaa 

gctactgggc 

gtgggcttac 

tgccagctgg 

.tcttgccgcc 

gatcgtttcg 

agaggctatt 

tccggctgtc 

tgaatgaact 

gcgcagctgt 

tgccggggca 

ctgatgcaat 

cgaaacatcg 

atctggacga 

gcatgcccga 

tggtggaaaa 

gctatcagga 

ctgaccgctt 

atcgccttct 

gacgcccaac 

cttcggaatc 

ggagttcttc 

gctggacgac 

cagcggctat 

tttggtcccg 

tacctacaga 

aactactgat 

agcagtggtg 

gtgatgatga 

tagaagaccc 

gtaatagaac 

acaagaaaat 

ataacatact 

ctcaaaaatt 

tgtatagtgc 

gctttaaaaa 

gttgttaact 

ttcacaaata 

gtatcttatc 

ctctacttga 

aattaggtca 

ggtaatttta 

cagcccacaa 

accctgctca 

attttcccca 

aacccagcac 

catctgctga 

cctgtagttt 

aaaatttgac 

atgcaagttt 

catcaaaata 

cgaaagggcc 



ttgttgttaa 

atttcacaaa 

atgtatctta 

ccccaggctc 

ggtgtggaaa 

agtcagcaac 

ccgcccattc 

cctcggcctc 

gcaaaaagct 

cacgtagaaa 

tatctggaca 

atggcgatag 

ggcgccctct 

aaggatctga 

catgattgaa 

cggctatgac 

agcgcagggg 

gcaggacgag 

gctcgacgtt 

ggatctcctg 

gcggcggctg 

catcgagcga 

agagcatcag 

cggcgaggat 

tggccgcttt 

catagcgttg 

cctcgtgctt 

tgacgagttc 

ctgccatcac 

gttttccggg 

gcccaccccg 

ctcgcggagt 

ccgcgcatcc 

gatctttgtg 

gatttaaagc 

tctaattgtt 

gaatgccttt 

ggctactgct 

caaggacttt 

tcttgcttgc 

tatggaaaaa 

gttttttctt 

gtgtaccttt 

cttgactaga 

acctcccaca 

tgtttattgc 

aagcattttt 

atgtctggat 

gaggacattc 

cttaacaaaa 

aaatatctgg 

atgtcaacag 

tcaagaagca 

cctgtgtagg 

tccactggat 

ctgtcaactg 

gctaacacac 

ccttgaatgg 

aacatagcag 

tttccacagg 

tcgtgatacg 



cttgtttatt 
taaagcattt 
tcatgtctgg 
cccagcaggc 
gtccccaggc 
catagtcccg- 
tccgccccat 
tgagctattc 
tcacgctgcc 
gccagtccgc 
agggaaaacg 
ctagactggg 
ggtaaggttg 
tggcgcaggg 
caagatggat 
tgggcacaac 
cgcccggttc 
gcagcgcggc 
gtcactgaag 
tcatctcacc 
catacgcttg 
gcacgtactc 
gggctcgcgc 
ctcgtcgtga 
tctggattca 
gctacccgtg 
tacggtatcg 
ttctgagcgg 
gagatttcga 
acgccggctg 
ggctcgatcc 
tctaccggca 
atgcccccga 
aaggaacctt 
tctaaggtaa 
tgtgtatttt 
aatgaggaaa 
gactctcaac 
ccttcagaat 
tttgctattt 
tattctgtaa 
actccacaca 
agctttttaa 
gatcataatc 
cctccccctg 
agcttataat 
ttcactgcat 
ccccaggaag 
caatcatagg 
aggaaattgg 
gaagtccctt 
cagaaacata 
ctgtggttgc 
ttccaaaata 
aagcattatc 
tagcattttt 
cctgcagctc 
gttttccagc 
ttaccccaat 
ttaagtcctc 
cctattttta 



gcagcttata 
ttttcactgc 
atccggctgt 
agaagtatgc 
tccccagcag 
cccctaactc 
ggctgactaa 
cagaagtagt 
gcaagcactc 
agaaacggtg 
caagcgcaaa* 
cggttttatg 
ggaagccctg 
gatcaagatc 
tgcacgcagg 
agacaatcgg 
tttttgtcaa 
tatcgtggct 
cgggaaggga 
ttgctcctgc 
atccggctac 
ggatggaagc 
cagccgaact 
cccatggcga 
tcgactgtgg 
atattgctga 
ccgctcccga 
gactctgggg 
ttccaccgcc 
gatgatcctc 
cctcgcgagt 
gtgcaaatcc 
actgcaggag 
acttctgtgg 
atataaaatt 
agattccaac 
acctgttttg 
attctactcc 
tgctaagttt 
acaccacaaa 
cctttataag 
ggcatagagt 
tttgtaaagg 
agccatacca 
aacctgaaac 
ggttacaaat 
tctagttgtg 
ctcctctgtg 
ctgcccatcc 
gtaggggttt 
ccactgctgt 
caagctgtca 
tgtgttagta 
tctagtgttt 
cttatccaaa 
tggggttaca 
caaaggttcc 
accattttca 
aacctcagtt 
atttaaatta 
taggttaatg 



atggttacaa 
attctagttg 
ggaatgtgtg 
aaagcatgca 
gcagaagtat 
cgcccatccc 
ttttttttat 
gaggaggctt 
agggcgcaag 
ctgaccccgg 
gagaaagcag 
gacagcaagc 
caaagtaaac 
tgatcaagag 
ttctccggcc 
ctgctctgat 
gaccgacctg 
ggccacgacg 
ctggctgcta 
cgagaaagta 
ctgcccattc 
cggtcttgtc 
gttcgccagg 
tgcctgcttg 
ccggctgggt 
agagcttggc 
ttcgcagcgc 
ttcgaaatga 
gccttctatg 
cagcgcgggg 
tggttcagct 
gtcggcatcc 
tggggaggca 
tgtgacataa 
tttaagtgta 
ctatggaact 
ctcagaagaa 
tccaaaaaag 
tttgagtcat 
ggaaaaagct 
taggcataac 
gtctgctatt 
ggttaataag 
catttgtaga 
ataaaatgaa 
aaagcaatag 
gtttgtccaa 
tcctcataaa 
accctctgtg 
ttcacagacc 
gttccagaag 
gctttgcaca 
atgtgcaaaa 
tcatttttac 
acagccttgt 
gtttgagcag 
ccaccaacag 
tgagtttttt 
ttaacagtaa 
ggcaaaggaa 
tcatgataat 



ataaagcaat 
tggtttgtcc 
tcagttaggg 
tctcaattag 
gcaaagcatg 
gcccctaact 
ttatgcagag 
ttttggaggc 
ggctgctaaa 
atgaatgtca 
gtagcttgca 
gaaccggaat 
tggatggctt 
acaggatgag 
gcttgggtgg 
gccgccgtgt 
tccggtgccc 
ggcgttcctt 
ttgggcgaag 
tccatcatgg 
gaccaccaag 
gatcaggatg 
ctcaaggcgc 
ccgaatafcca 
gtggcggacc 
ggcgaatggg 
atcgccttct 
ccgaccaagc 
aaaggttggg 
atctcatgct 
gctgcctgag 
aggaaaccag 
cgatggccgc 
ttggacaaac 
taatgtgtta 
gatgaatggg 
atgccatcta 
aagagaaagg 
gctgtgttta 
gcactgctat 
agttataatc 
aataactatg 
gaatatttga 
ggttttactt 
tgcaattgtt 
catcacaaat 
actcatcaat 
ccctaacctc 
tcctcctgtt 
gctttctaag 
tgttggtaaa 
agggcccaac 
caggaggcac 
ttggatcagg 
ggtcagtgtt 
gatatttggt 
caaaaaaatg 
gtgtccctga 
cagcttccca 
ttcttgaaga 
aatggtttct 



5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 



WO 03/062400 



PCT/US03/02295 



-14- 



taqacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg tttatttttc 9180 
talatacatt caaatatgta tccgctcatg agacaataac cctgataaat gcttcaataa 9240 
tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat tccctttttt 9300 
qcgqcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt aaaagatgct 9360 
aaaaatcaat tqqgtgcacg agtgggttac atcgaactgg atctcaacag cggtaagatc 9420 
cttgagagtt Scfccccgl alallgtttt ccaltgatga gcacttttaa agttctgcta 9480 
tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc aactcggtcg ccgcatacac 9540 
tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct tacggatggc 9600 
atqacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac tgcggccaac 9660 
ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca caacatgggg 9720 
qatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat accaaacgac 9780 
gagcgtgaca ccacgatgcc tgcagcaatg gcaacaacgt tgcgcaaact attaactggc 9840 
qaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc ggataaagtt 9900 
qcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga taaatctgga 9960 
gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg taagccctcc 10020 
cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg aaatagacag 10080 
atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca agtttactca 10140 
tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta ggtgaagatc 10200 
cttt ttgata atctcatgac caaaatccct taacgtgagt tttcgttcca ctgagcgtca 10260 
gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg cgtaatctgc 10320 
tqcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga tcaagagcta 10380 
ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa tactgtcctt 10440 
ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc tacatacctc 10500 
gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg tcttaccggg 10560 
ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac ggggggttcg 10620 
tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct. acagcgtgag 10680 
ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc ggtaagcggc 10740 
agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg gtatctttat 10800 
altcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg ctcgtcaggg 10860 
glgcSgcl tltggalaaa cgcoagcaac gcggcctttt tacggttcct ggccttttgc 10920 
tlgccttttg ctcacatgtt ctttcctgcg ttatcccctg attctgtgga taaccgtatt 10980 
accgcctttg agtgagctga taccgctcgc cgcagccgaa cgaccgagcg cagcgagtca 11040 
gtgagcgagg aagcggaaga gcgcctgatg cggtattttc tccttacgca tctgtgcggt 11100 
Itttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc 11160 
cagtatctgc tccctgcttg tgtgttggag gtcgctgagt agtgcgcgag caaaatttaa 11220 
gctacaacaa ggcaaggctt gaccgacaat tgcatgaaga atctgcttag ggttaggcgt 11280 
tttgcgctgc ttcgcgatgt acgggccaga tatacgcgta tctgagggga ctagggtgtg 11340 
tttaggcgaa aagcggggct tcggttgtac gcggttagga gtcccctcag gatatagtag 11400 
tttcgctttt gcatagggag ggggaaatgt agtcttatgc aatacacttg tagtcttgca 11*60 
acatggtaac gatgagttag caacatgcct tacaaggaga gaaaaagcac cgtgcatgcc .11520 
gattggtgga agtaaggtgg tacgatcgtg ccttattagg aaggcaacag acgggtctga 11580 
catggattgg acgaaccact liooo 



<210> 43 
<23.1> 35211 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasmid AvlnBg 



<400> 43 . . cn 

catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttgtgacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg gcggaagtgt 120 
gatgttgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
gtgtgcgccg gtgtacacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg cgtaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtqaaatct gaataatttt gtgttactca tagcgcgtaa tatttgtcta gggccgcggg 360 
gactttgacc gtttacgtgg agactcgccc aggtgttttt ctcaggtgtt ttccgcgttc 420 
cqqqtcaaag ttggcgtttt attattatag tcagtacgta ccagtgcact ggcctaggaa 480 
g 2t?ggtacc ggtgaattcg ctagcgttcg cgccccgatg tacgggccag atatacgcgt 540 
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atctgagggg actagggtgt gtttaggcga 
agtcccctca ggatatagta gtttcgcttt 
caatactctt gtagtcttgc aacatggtaa 
agaaaaagca ccgtgcatgc cgattggtgg 
gaaggcaaca gacgggtctg acatggattg 
tattgtattt aagtgcctag ctcgatacaa 
tgtgcacctc cggccctggc cactctcttc 
ggctcgcggt tgaggacaaa ctcttcgcgg 
tcggcctccg aacggtactc cgccgccgag 
ggaaaacctc tcgagaaagg cgtgtaacca 
ccgggctgca ggaattcgat ctagatggat 
gaccccaagg actttccttc agaattgcta 
ttacaacgtc gtgactggga aaaccctggc 
ccccctttcg ccagctggcg taatagcgaa 
ttgcgcagcc tgaatggcga atggcgcttt 
gaaagctggc tggagtgcga tcttcctgag 
cagatgcacg gttacgatgc gcccatctac 
ccgccgtttg ttcccacgga gaatccgacg 
gaaagctggc tacaggaagg ccagacgcga 
catctgtggt gcaacgggcg ctgggtcggt 
tttgacctga gcgcattttt acgcgccgga 
tggagtgacg gcagttatct ggaagatcag 
gacgtctcgt tgctgcataa accgactaca 
tttaatgatg atttcagccg cgctgtactg 
cgtgactacc tacgggtaac agtttcttta 
accgcgcctt tcggcggtga aattatcgat 
ctacgtctga acgtcgaaaa cccgaaactg 
gcggtggttg aactgcacac cgccgacggc 
ggtttccgcg aggtgcggat tgaaaatggt 
attcgaggcg ttaaccgtca cgagcatcat 
acgatggtgc aggatatcct gctgatgaag 
cattatccga accatccgct gtggtacacg 
gatgaagcca atattgaaac ccacggcatg 
cgctggctac cggcgatgag cgaacgcgta 
ccgagtgtga tcatctggtc gctggggaat 
ctgtatcgct ggatcaaatc tgtcgatcct 
gccgacacca cggccaccga tattatttgc 
cccttcccgg ctgtgccgaa atggtccatc 
cgcccgctga tcctttgcga atacgcccac 
aaatactggc aggcgtttcg tcagtatccc 
gtggatcagt cgctgattaa atatgatgaa 
gattttggcg atacgccgaa cgatcgccag 
cgcacgccgc atccagcgct gacggaagca 
ttatccgggc aaaccatcga agtgaccagc 
ctcctgcact ggatggtggc gctggatggt 
gatgtcgctc cacaaggtaa acagttgatt 
gccgggcaac tctggctcac agtacgcgta 
gccgggcaca tcagcgcctg gcagcagtgg 
cccgccgcgt cccacgccat cccgcatctg 
ctgggtaata agcgttggca atttaaccgc 
ggcgataaaa aacaactgct gacgccgctg 
aacgacattg gcgtaagtga agcgacccgc 
aaggcggcgg gccattacca ggccgaagca 
gctgafcgcgg tgctgattac gaccgctcac 
atcagccgga aaacctaccg gattgatggt 
gaagtggcga gcgatacacc gcatccggcg 
gtagcagagc gggtaaactg gctcggatta 
actgccgcct gttttgaccg ctgggatctg 
ttcccgagcg aaaacggtct gcgctgcggg 
tggcgcggcg acttccagtt caacatcagc 
agccatcgcc atctgctgca cgcggaagaa 



aaagcggggc ttcggttgta cgcggttagg 600 
tgcataggga gggggaaatg tagtcttatg 660 
cgatgagtta gcaacatgcc ttacaaggag 720 
aagtaaggtg gtacgatcgt gccttattag 780 
gacgaaccac tgaattccgc attgcagaga 840 
taaacgccat ttgaccattc accacattgg 900 
cgcatcgctg tctgcggggg ccagctgttg 960 
tctttccagt actcttggat cggaaacccg 1020 
ggacctgagc gagtccgcat cgaccggatc 1080 
gtcacagtcg ctctagaact agtggatccc 1140 
aaaggtccaa aaaagaagag aaaggtagaa 1200 
agttttttga gtgattcact ggccgtcgtt 1260 
gttacccaac ttaatcgcct tgcagcacat 1320 
gaggcccgca ccgatcgccc ttcccaacag 1380 
gcctggtttc cggcaccaga agcggtgccg 1440 
gccgatactg tcgtcgtccc ctcaaactgg 1500 
accaacgtaa cctatcccat tacggtcaat 1560 
ggttgttact cgctcacatt taatgttgat 1620 
attatttttg atggcgttaa ctcggcgttt 1680 
tacggccagg acagtcgttt gccgtctgaa 1740 
gaaaaccgcc tcgcggtgat ggtgctgcgt 1800 
gatatgtggc ggatgagcgg cattttccgt 1860 
caaatcagcg atttccatgt tgccactcgc 1920 
gaggctgaag ttcagatgtg cggcgagttg 1980 . 
tggcagggtg aaacgcaggt cgccagcggc 2040 
gagcgtggtg gttatgccga tcgcgtcaca 2100 
tggagcgccg aaatcccgaa tctctatcgt 2160 
acgctgattg aagcagaagc ctgcgatgtc 2220 
ctgctgctgc tgaacggcaa gccgttgctg 2280 
cctctgcatg gtcaggtcat ggatgagcag 2340 
cagaacaact ttaacgccgt gcgctgttcg 2400 
ctgtgcgacc gctacggcct gtatgtggtg 2460 
gtgccaatga atcgtctgac cgatgatccg 2520 
acgcgaatgg tgcagcgcga tcgtaatcac 2580 
gaatcaggcc acggcgctaa tcacgacgcg 2640 
tcccgcccgg tgcagtatga aggcggcgga 2700 
ccgatgtacg cgcgcgtgga tgaagaccag 2760 
aaaaaatggc tttcgctacc tggagagacg 2 820 
gcgatgggta acagtcttgg cggtttcgct 2 880 
cgtttacagg gcggcttcgt ctgggactgg 2940 
aacggcaacc cgtggtcggc ttacggcggt 3000 
ttctgtatga acggtctggt ctttgccgac 3060 
aaacaccagc agcagttttt ccagttccgt 3120 
gaatacctgt tccgtcatag cgataacgag 3180 
aagccgctgg caagcggtga agtgcctctg 3240 
gaactgcctg aactaccgca gccggagagc 3300 
gtgcaaccga acgcgaccgc atggtcagaa 3360 
cgtctggcgg aaaacctcag tgtgacgctc 3420 
accaccagcg aaatggattt ttgcatcgag 3480 
cagtcaggct ttctttcaca gatgtggatt 3540 
cgcgatcagt tcacccgtgc accgctggat 3600 
attgacccta acgcctgggt cgaacgctgg 3660 
gcgttgttgc agtgcacggc agatacacfct 3720 
gcgtggcagc atcaggggaa aaccttattt 3780 
agtggtcaaa tggcgattac cgttgafcgtt 3 840 
cggattggcc tgaactgcca gctggcgcag 3900 
gggccgcaag aaaactatcc cgaccgcctt 3960 
ccattgtcag acatgtatac cccgtacgtc 4020 
acgcgcgaat tgaattatgg cccacaccag 4 080 
cgctacagtc aacagcaact gatggaaacc 4140 
ggcacatggc tgaatatcga cggtttccat 4200 
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ataoaaattq qtggcgacga ctcctggagc ccgtcagtat cggcggaatt tcagctgagc 4260 
gcclfcgct SSttacla gttggtltgg tgtcaaaaat aataatctcg aatcaagctt 4320 
f tcaataccq tcgaaacttg tttattgcag cttataatgg ttacaaataa agcaatagca 4380 
tcacaaattt cHaaataai gcattttttt cactgcattc tagttgtggt ttgtccaaac 4440 
^catcaatgt Scttatcat gtctggatcc gacctcggat ctggaaggtg ctgaggtacg 4500 
ataagacclg Xlcaggtgc Igaccdtgcg agtgtggcgg taaacatatt aggaaccagc 4560 
Sqtqatgcf gStgEialc gaggagctga ggcccgatca cttggtgctg gcctgcaccc 4620 
gcSgagS tggctctagc gatgaagata cagattgagg tactgaaatg tgtgggcgtg 4680 
gcftalgggt gggaaagaat Itataaggtg ggggtcttat gtagttttgt atctgttttg 4740 
clacagcclc Iglcgccatg agcaccaact cgtttgatgg aagcattgtg agctcatatt 4800 
tgacalcgfg cftglccccl tgggccgggg tgcgtcagaa tgtgatgggc tccagcattg 4860 
ataatcaccc cgtcctgccc gcaaactcta ctaccttgac ctacgagacc gtgtctggaa 4920 
SlSttS gactgcagcc tccgccgccg cttcagccgc tgcagccacc gcccgcggga 4980 
ttrtlactla otttgctttc ctgagcccgc ttgcaagcag tgcagcttcc cgttcatccg 5040 
cccglgatfa caagttgacg gctcttttgg cacaattgga ttctttgacc cgggaactta 5100 
atSSttte tcagcagctg ttggatctgc gccagcaggt ttctgccctg aaggcttcct 5160 
cccctlccat tgcjgtltai aacataaata aaaaaccaga ctctgtttgg atttggatca 5220 
aqcaagtgtc ttgltgtctt tatttagggg ttttgcgcgc gcggtaggcc cgggaccagc 5280 
Slhrtcaotc qttgagggtc ctgtgtattt tttccaggac gtggtaaagg tgactctgga 5340 
tSSSStl latgg|cata aglccgtctc tggggtggag gtagcaccac tgcagagctt 5400 
cSqctlcgg ggtHtgttg tlgatgatcc agtcgtagca ggagcgctgg gcgtggtgcc 5460 
taalaatqtc tltllgtagc aagctgattg ccaggggcag gcccttggtg taagtgttta 5520 
cataqcSft aagctggglt gggtgcatac gtggggatat gagatgcatc ttggactgta 5580 
SKtSgftt ggctatgttc ccagccatat ccctccgggg attcatgttg tgcagaacca 5640 
ccaaclcagt glatccggtg cacttgggaa atttgtcatg tagcttagaa ggaaatgcgt 5700 
aaaSIctt ggagacglcl ttgtgacctc caagattttc catgcattcg tccataatga 5760 
?!gofatggg ccclcgggcg gclglctggg cgaagatatt tctgggatca ctaacgtcat 5820 
aattqtqttc aggatgagat cgtcataggc catttttaca aagcgcgggc ggagggtgcc 5880 
aqaclgcggt atLtggtte catccggccc aggggcgtag ttaccctcac agatttgcat 5940 
ttccclcgft ttgagttcag atggggggat catgtctacc tgcggggcga tgaagaaaac 6000 
ggtttccfgg gtfggggaga tcagctggga agaaagcagg ttcctgagca gctgcgactt 6060 
Iccacagccq gtgggcccgt aaatcacacc tattaccggg tgcaactggt agttaagaga 6120 
gctfcagctg Icg?catclc tgagcagggg ggccacttcg ttaagcatgt ccctgactcg 6180 
latattttcc ctgaccaaat ccgccagaag gcgctcgccg cccagcgata gcagttcttg 6240 
laaSagca aagtttttca acggtttgal accgtccgcc gtaggcatgc ttttgagcgt 6300 
ttSccalqc agttccaggc ggtcccacag ctcggtcacc tgctctacgg catctcgatc 6360 
cagcalatlt ScgttSg cgggttgggg cggctttcgc tgtacggcag tagtcggtgc 6420 
tcotccaqac gggccagggt catgtctttc cacgggcgca gggtcctcgt cagcgtagtc b«u 
^SScacgg fgaagggglg cgctccgggc tgcgcgctgg ccagggtgcg cttgaggctg 6540 
qtcSgctS tlctilllcg ctgccggtct tcgccotgcg cgtcggccag gtagcatttg 6600 
accatSft cltagtccag cccctccgcg gcgtggccct tggcgcgcag cttgcccttg 6660 
alqqaqqcic cgcalgaggg gcagtgcaga cttttgaggg cgtagagctt gggcgcgaga 6720 
laf Icgaft ccggggagfa ggcltlcgcg ccgcaggccc cgcagacggt ctcgcattcc 6780 
acaaqccaqg tgagctctgg ccgttcgggg tcaaaaacca ggtttccccc atgctttttg b»*u 
atKtttl? tlcltctgft ttccatgagc cggtgtccac Sctcggtgac Egmag* 6900 
tccatcttccc cgtatacaga cttgagaggc ctgtcctcga gcggtgttcc gcggtcctcc bsbu 
tcgtSgaa altcggacla ctclglgaca aaggctcgcg tccaggccag cacgaaggag 7020 
actaaqtqS aggggtagcg gtcgttgtcc actagggggt ccactcgctc cagggtgtga 7080 
aqacaiafU cgclltcttl ggcitcaagg aaggtgattg gtttgtaggt gtaggccacg 7140 
tlaccgggtg t?cctgaagg igggctataa aagggggtgg gggcgcgttc gtcctcactc 7200 
tcttcllcat cgctglctgc gagggccagc tgttggggtg agtactccct ctgaaaagcg 7260 
qqcatglctt clgcgctaag Sttgtcagtt tccaaaaacg aggaggattt gatattcacc 7320 
?aqcccqcqq tgltgccttt gagggtggcc gcatccatct ggtcagaaaa gacaatcttt 7380 
£lgttgtcal gcttlgtggc lalllacccg tagagggcgt tggacagcaa cttggcgatg 7440 
galcglaggg tttggttttt gtcgcgatcg gcgcgctcct tggccgcgat gtttagctgc 7500 
Icltlttlp gcgllacgca ccgccattcg ggaaagacgg tggtgcgctc gtcgggcacc 7560 
aqgtgcacgc Iclaaccgcg gttgtgcagg gtgacaaggt caacgctggt ga«t a ^tct 7620 
cclcltagic |ctcgttggt ccagcagagg cggccgccct tgcgcgagca g^atggcggt 7680 
Sgggtlta ictgcgtctc gtccgggggg tctgcgtcca cggtaaagac cccgggcagc 7740 
aqqcqlqcqt Igaagtagtc tatcttgcat ccttgcaagt ctagcgcctg ctgccatgcg 7800 
cgglggcla glgcgcgftc gtatgggttg agtgggggac cccatggcat ggggtgggtg 7860 
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agcgcggagg cgtacatgcc gcaaatgtcg taaacgtaga ggggctctct gagtattcca 7920 
agatatgtag ggtagcatct tccaccgcgg atgctggcgc gcacgtaatc gtatagttcg 7980 
tgcgagggag cgaggaggtc gggaccgagg ttgctacggg cgggctgctc tgctcggaag 8040 
actatctgcc tgaagatggc atgtgagttg gatgatatgg ttggacgctg gaagacgttg 8100 
aagctggcgt ctgtgagacc taccgcgtca cgcacgaagg aggcgtagga gtcgcgcagc 8160 
ttgttgacca gctcggcggt gacctgcacg tctagggcgc agtagtccag ggtttccttg 8220 
atgatgtcat acttatcctg tccctttttt ttccacagct cgcggttgag gacaaactct 8280 
tcgcggtctt tccagtactc ttggatcgga aacccgtcgg cctccgaacg gtaagagcct 8340 
agcatgtaga actggttgac ggcctggtag gcgcagcatc ccttttctac gggtagcgcg 8400 
tatgcctgcg cggccttccg gagcgaggtg tgggtgagcg caaaggtgtc cctgaccatg 8460 
actttgaggt actggtattt gaagtcagtg tcgtcgcatc cgccctgctc ccagagcaaa 8520 
aagtccgtgc gctttttgga acgcggattt ggcagggcga aggtgacatc gttgaagagt 8580 
atctttcccg cgcgaggcat aaagttgcgt gtgatgcgga agggtcccgg cacctcggaa 8640 
cggttgttaa ttacctgggc ggcgagcacg atctcgtcaa agccgttgat gttgtggccc 8700 
acaatgtaaa gttccaagaa gcgcgggatg cccttgatgg aaggcaattt tttaagttcc 8760 
tcgtaggtga gctcttcagg ggagctgagc ccgtgctctg aaagggccca gtctgcaaga 8820 
tgagggttgg aagcgacgaa tgagctccac aggtcacggg ccattagcat ttgcaggtgg 8880 
tcgcgaaagg tcctaaactg gcgacctatg gccatttttt ctggggtgat gcagtagaag 8940 
gtaagcgggt cttgttccca gcggtcccat ccaaggttcg cggctaggtc tcgcgcggca 9000 
gtcactagag gctcatctcc gccgaacttc atgaccagca tgaagggcac gagctgcttc 9060 
ccaaaggccc ccatccaagt ataggtctct acatcgtagg tgacaaagag acgctcggtg 9120 
cgaggatgcg agccgatcgg gaagaactgg atctcccgcc accaattgga ggagtggcta 9180 
ttgatgtggt gaaagtagaa gtccctgcga cgggccgaac actcgtgctg gcttttgtaa 9240 
aaacgtgcgc agtactggca gcggtgcacg ggctgtacat cctgcacgag gttgacctga 9300 
cgaccgcgca caaggaagca gagtgggaat ttgagcccct cgcctggcgg gtttggctgg 9360 
tggtcttcta cttcggctgc ttgtccttga ccgtctggct gctcgagggg agttacggtg 9420 
gatcggacca ccacgccgcg cgagcccaaa gtccagatgt ccgcgcgcgg cggtcggagc 94 80 
ttgatgacaa catcgcgcag atgggagctg tccatggtct ggagctcccg cggcgtcagg 9540 
tcaggcggga gctcctgcag gtttacctcg catagacggg tcagggcgcg ggctagatcc 9600 
aggtgatacc taatttccag gggctggttg gtggcggcgt cgatggcttg caagaggccg 9660 
catccccgcg gcgcgactac ggtaccgcgc ggcgggcggt gggccgcggg ggtgtccttg 9720 
gatgatgcat ctaaaagcgg tgacgcgggc gagcccccgg aggtaggggg ggctccggac 9780 
ccgccgggag agggggcagg ggcacgtcgg cgccgcgcgc gggcaggagc tggtgctgcg 9840 
cgcgtaggtt gctggcgaac gcgacgacgc ggcggttgat ctcctgaatc tggcgcctct 9900 
gcgtgaagac gacgggcccg gtgagcttga gcctgaaaga gagttcgaca gaatcaattt 9960 
cggtgtcgtt gacggcggcc tggcgcaaaa tctcctgcac gtctcctgag ttgtcttgat 10020 
aggcgatctc ggccatgaac tgctcgatct cttcctcctg gagatctccg cgtccggctc 10080 
gctccacggt ggcggcgagg tcgttggaaa tgcgggccat gagctgcgag aaggcgttga 10140 
ggcctccctc gttccagacg cggctgtaga ccacgccccc ttcggcatcg cgggcgcgca 10200 
tgaccacctg cgcgagattg agctccacgt gccgggcgaa gacggcgtag tttcgcaggc 10260 
gctgaaagag gtagttgagg gtggtggcgg tgtgttctgc cacgaagaag tacataaccc 10320 
agcgtcgcaa cgtggattcg ttgatatccc ccaaggcctc aaggcgctcc atggcctcgt 10380 
agaagtccac ggcgaagttg aaaaactggg agttgcgcgc cgacacggtt aactcctcct 10440 
ccagaagacg gatgagctcg gcgacagtgt cgcgcacctc gcgctcaaag gctacagggg 10500 
cctcttcttc ttcttcaatc tcctcttcca taagggcctc cccttcttct tcttctggcg 10560 
gcggtggggg aggggggaca cggcggcgac gacggcgcac cgggaggcgg tcgacaaagc 10620 
gctcgatcat ctccccgcgg cgacggcgca tggtctcggt gacggcgcgg ccgttctcgc 10680 
gggggcgcag ttggaagacg ccgcccgtca tgtcccggtt atgggttggc ggggggctgc 10740 
catgcggcag ggatacggcg ctaacgatgc atctcaacaa ttgttgtgta ggtactccgc 10800 
cgccgaggga cctgagcgag tccgcatcga ccggatcgga aaacctctcg agaaaggcgt 10860 
ctaaccagtc acagtcgcaa ggtaggctga gcaccgtggc gggcggcagc gggcggcggt 10920 
cggggttgtt tctggcggag gtgctgctga tgatgtaatt aaagtaggcg gtcttgagac 10980 
ggcggatggt cgacagaagc accatgtcct tgggtccggc ctgctgaatg cgcaggcggt 11040 
cggccatgcc ccaggcttcg ttttgacatc ggcgcaggtc tttgtagtag tcttgcatga 11100 
gcctttctac cggcacttct tcttctcctt cctcttgtcc tgcatctctt gcatctatcg 11160 
ctgcggcggc ggcggagttt ggccgtaggt ggcgccctct tcctcccatg cgtgtgaccc 11220 
cgaagcccct catcggctga agcagggcta ggtcggcgac aacgcgctcg gctaatatgg 11280 
cctgctgcac ctgcgtgagg gtagactgga agtcatccat gtccacaaag cggtggtatg 11340 
cgcccgtgtt gatggtgtaa gtgcagttgg ccataacgga ccagttaacg gtctggtgac 11400 
ccggctgcga gagctcggtg tacctgagac gcgagtaagc cctcgagtca aatacgtagt 11460 
cgttgcaagt ccgcaccagg tactggtatc ccaccaaaaa gtgcggcggc ggctggcggt 11520 
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agaggggcca 

gatatccgta 

gaaagtcgcg 

cgctctggcc 

tgtaagcggg 

accggggttc 

gtcgaaccca 

gcgcggcggc 

gctggaaagc 

gttgagtcgc 

ttgcctcccc 

ttttttgctt 

ggcaagagca 

gaggggcgac 

gggcccggca 

ctcctgagcg 

ggcagaacct 

tccacgcagg 

actttgagcc 

acctggtaac 

acaaccacgt 

gggactttgt 

tccttatagt 

tagagcccga 

aggagcgcag 

tgggcaagtt 

taaagatcga 

tgggcgttta 

tcagcgaccg 

atagagaggc 

gcgccctgga 

acgtcggcgg 

actaagcggt 

ggcgctgcag 

ccgcatcatg 

ccggctctcc 

ggtgctggcg 

cctggtctac 

caacctggac 

gcagcagggc 

caacgtgccg 

gactgagaca 

tagacaaggc 

gggggtgcgg 

gcgcctgttg 

atacctaggt 

gcatactttc 

cctggaggca 

cagtttaaac 

cctgatgcgc 

accgggcatg 

cgcggccgcc 

gccccctggt 

ggacgacata 

gcgcgagcag 

cgatctaggc 

gtctcttacc 

caactcgctg 

gatagagagc 

cgtgccaggc 

gtgggaggac 



gcgtagggtg 

gatgtacctg 

gacgcggttc 

ggtcaggcgc 

cactcttccg 

gagccccgta 

ggtgtgcgac 

tgctgcgcta 

gaaagcatta 

gggacccccg 

gtcatgcaag 

ttcccagatg 

agagcagcgg 

atccgcggtt 

ctacctggac 

gtacccaagg 

gtttcgcgac 

gcgcgagctg 

cgacgcgcga 

cgcatacgag 

gcgtacgctt 

aagcgcgctg 

gcagcacagc 

gggccgctgg 

cttgagcctg 

ttacgcccgc 

ggggttctac 

tcgcaacgag 

cgagctgatg 

cgagtcctac 

ggcagctggg 

cgtggaggaa 

gatgtttctg 

agccagccgt 

tcgctgactg 

gcaattctgg 

atcgtaaacg 

gacgcgctgc 

cggctggtgg 

aacctgggct 

cggggacagg 

ccgcaaagtg 

ctgcagaccg 

gctcccacag 

ctgctgctaa 

cacttgctga 

caggagatta 

accctaaact 

agcgaggagg 

gacggggtaa 

tatgcctcaa 

gtgaaccccg 

ttctacaccg 

gacgacagcg 

gcagaggcgg 

gctgcggccc 

agcactcgca 

ctgcagccgc 

ctagtggaca 

ccgcgcccgc 

gatgactcgg 



gccggggctc cgggggcgag atcttccaac ataaggcgat 11580 
gacatccagg tgatgccggc ggcggtggtg gaggcgcgcg 11640 
cagatgttgc gcagcggcaa aaagtgctcc atggtcggga 11700 
gcgcaatcgt tgacgctcta gaccgtgcaa aaggagagcc 11760 
tggtctggtg gataaattcg caagggtatc atggcggacg 11820 
tccggccgtc cgccgtgatc catgcggtta ccgcccgcgt 11880 
gtcagacaac gggggagtgc tccttttggc ttccttccag 11940 
gcttttttgg ccactggccg cgcgcagcgt aagcggttag 12000 
agtggctcgc tccctgtagc cggagggtta ttttccaagg 12060 
gttcgagtct cggaccggcc ggactgcggc gaacgggggt 12120 
accccgcttg caaattcctc cggaaacagg gacgagcccc 12180 
catccggtgc tgcggcagat gcgcccccct cctcagcagc 12240 
cagacatgca gggcaccctc ccctcctcct accgcgtcag 12300 
gacgcggcag cagatggtga ttacgaaccc ccgcggcgcc 12360 
ttggaggagg gcgagggcct ggcgcggcta ggagcgccct 12420 
gtgcagctga agcgtgatac gcgtgaggcg tacgtgccgc 12480 
cgcgagggag aggagcccga ggagatgcgg gatcgaaagt 12540 
cggcatggcc tgaatcgcga gcggttgctg cgcgaggagg 12600 
accgggatta gtcccgcgcg cgcacacgtg gcggccgccg 12660 
cagacggtga accaggagat taactttcaa aaaagcttta 12720 
gtggcgcgcg aggaggtggc tataggactg atgcatctgt 12780 
gagcaaaacc caaatagcaa gccgctcatg gcgcagctgt 12840 
agggacaacg aggcattcag ggatgcgctg ctaaacatag 12900 
ctgctcgatt tgataaacat cctgcagagc atagtggtgc 12960 
gctgacaagg tggccgccat caactattcc atgcfctagcc 13020 
aagatatacc atacccctta cgttcccata gacaaggagg 13080 
atgcgcatgg cgctgaaggt gcttaccttg agcgacgacc 13140 
cgcatccaca aggccgtgag cgtgagccgg cggcgcgagc 13200 
cacagcctgc aaagggccct ggctggcacg ggcagcggcg 13260 
tttgacgcgg gcgctgacct gcgctgggcc ccaagccgac 13320 
gccggacctg ggctggcggt ggcacccgcg cgcgctggca 13380 
tatgacgagg acgatgagta cgagccagag gacggcgagt 13440 
atcagatgat gcaagacgca acggacccgg cggtgcgggc 13500 
ccggccttaa ctccacggac gactggcgcc aggtcatgga 13560 
cgcgcaatcc tgacgcgttc cggcagcagc cgcaggccaa 13620 
aagcggtggt cccggcgcgc gcaaacccca cgcacgagaa 13680 
cgctggccga aaacagggcc atccggcccg acgaggccgg 13740 
ttcagcgcgt ggctcgttac aacagcggca acgtgcagac 13800 
gggatgtgcg cgaggccgtg gcgcagcgtg agcgcgcgca 13860 
ccatggttgc actaaacgcc ttcctgagta cacagcccgc 13920 
aggactacac caactttgtg agcgcactgc ggctaatggt 13980 
aggtgtacca gtctgggcca gactattttt tccagaccag 14040 
taaacctgag ccaggctttc aaaaacttgc aggggctgtg 14100 
gcgaccgcgc gaccgtgtct agcttgctga cgcccaactc 14160 
tagcgccctt cacggacagt ggcagcgtgt cccgggacac 14220 
cactgtaccg cgaggccata ggtcaggcgc atgtggacga 14280 
caagtgtcag ccgcgcgctg gggcaggagg acacgggcag 14340 
acctgctgac caaccggcgg cagaagatcc cctcgttgca 14400 
agcgcatttt gcgctacgtg cagcagagcg tgagccttaa 14460 
cgcccagcgt ggcgctggac atgaccgcgc gcaacatgga 14520 
accggccgtt tatcaaccgc ctaatggact acttgcatcg 14580 
agtatttcac caatgccatc ttgaacccgc actggctacc 14640 
ggggattcga ggtgcccgag ggtaacgatg gattcctctg 14700 
tgttttf:cccc gcaaccgcag accctgctag agttgcaaca 14760 
cgctgcgaaa ggaaagcttc cgcaggccaa gcagcttgtc 14820 
cgcggtcaga tgctagtagc ccatttccaa gcttgatagg 14880 
ccacccgccc gcgcctgctg ggcgaggagg agtacctaaa 14940 
agcgcgaaaa aaacctgcct ccggcatttc ccaacaacgg 15000 
agatgagtag atggaagacg tacgcgcagg agcacaggga 15060 
ccacccgtcg tcaaaggcac gaccgtcagc ggggtctggt 15120 
cagacgacag cagcgtcctg gatttgggag ggagtggcaa 15180 
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cccgtttgcg caccttcgcc ccaggctggg gagaatgttt taaaaaaaaa aaagcatgat 15240 
gcaaaataaa aaactcacca aggccatggc accgagcgtt ggttttcttg tattcccctt 15300 
agtatgcggc gcgcggcgat gtatgaggaa ggtcctcctc cctcctacga gagtgtggtg 15360 
agcgcggcgc cagtggcggc ggcgctgggt tctcccttcg atgctcccct ggacccgccg 15420 
tttgtgcctc cgcggtacct gcggcctacc ggggggagaa acagcatccg ttactctgag 15480 
ttggcacccc tattcgacac cacccgtgtg tacctggtgg acaacaagtc aacggatgtg 15540 
gcatccctga actaccagaa cgaccacagc aactttctga ccacggtcat tcaaaacaat 15600 
gactacagcc cgggggaggc aagcacacag accatcaatc ttgacgaccg gtcgcactgg 15660 
ggcggcgacc tgaaaaccat cctgcatacc aacatgccaa atgtgaacga gttcatgttt 15720 
accaataagt ttaaggcgcg ggtgatggtg tcgcgcttgc ctactaagga caatcaggtg 15780 
gagctgaaat acgagtgggt ggagttcacg ctgcccgagg gcaactactc cgagaccatg 15840 
accatagacc ttatgaacaa cgcgatcgtg gagcactact tgaaagtggg cagacagaac 15900 
ggggttctgg aaagcgacat cggggtaaag tttgacaccc gcaacttcag actggggttt 15960 
gaccccgtca ctggtcttgt catgcctggg gtatatacaa acgaagcctt ccatccagac 16020 
atcattttgc tgccaggatg cggggtggac ttcacccaca gccgcctgag caacttgttg 16080 
ggcatccgca agcggcaacc cttccaggag ggctttagga tcacctacga tgatctggag 16140 
ggtggtaaca ttcccgcact gttggatgtg gacgcctacc aggcgagctt gaaagatgac 16200 
accgaacagg gcgggggtgg cgcaggcggc agcaacagca gtggcagcgg cgcggaagag 16260 
aactccaacg cggcagccgc ggcaatgcag ccggtggagg acatgaacga tcatgccatt 16320 
cgcggcgaca cctttgccac acgggctgag gagaagcgcg ctgaggccga agcagcggcc 16380 
gaagctgccg cccccgctgc gcaacccgag gtcgagaagc ctcagaagaa accggtgatc 16440 
aaacccctga cagaggacag caagaaacgc agttacaacc taataagcaa tgacagcacc 16500 
ttcacccagt accgcagctg gtaccttgca tacaactacg gcgaccctca gaccggaatc 16560 
cgctcatgga ccctgctttg cactcctgac gtaacctgcg gctcggagca ggtctactgg 16620 
tcgttgccag acatgatgca agaccccgtg accttccgct ccacgcgcca gatcagcaac 16680 
tttccggtgg tgggcgccga gctgttgccc gtgcactcca agagcttcta caacgaccag 16740 
gccgtctact cccaactcat ccgccagttt acctctctga cccacgtgtt caatcgcttt 16800 
cccgagaacc agattttggc gcgcccgcca gcccccacca tcaccaccgt cagtgaaaac 16860 
gttcctgctc tcacagatca cgggacgcta ccgctgcgca acagcatcgg aggagtccag 16920 
cgagtgacca ttactgacgc cagacgccgc acctgcccct acgtttacaa ggccctgggc 16980 
atagtctcgc cgcgcgtcct atcgagccgc actttttgag caagcatgtc catccttata 17040 
tcgcccagca ataacacagg ctggggcctg cgcttcccaa gcaagatgtt tggcggggcc 17100 
aagaagcgct ccgaccaaca cccagtgcgc gtgcgcgggc actaccgcgc gccctggggc 17160 
gcgcacaaac gcggccgcac tgggcgcacc accgtcgatg acgccatcga cgcggtggtg 17220 
gaggaggcgc gcaactacac gcccacgccg ccaccagtgt ccacagtgga cgcggccatt 17280 
cagaccgtgg tgcgcggagc ccggcgctat gctaaaatga agagacggcg gaggcgcgta 17340 
gcacgtcgcc accgccgccg acccggcact gccgcccaac gcgcggcggc ggccctgctt 17400 
aaccgcgcac gtcgcaccgg ccgacgggcg gccatgcggg ccgctcgaag gctggccgcg 17460 
ggtattgtca ctgtgccccc caggtccagg cgacgagcgg ccgccgcagc agccgcggcc 17520 
attagfcgcta tgactcaggg tcgcaggggc aacgtgtatt gggtgcgcga ctcggttagc 17580 
ggcctgcgcg tgcccgtgcg cacccgcccc ccgcgcaact agattgcaag aaaaaactac 17640 
ttagactcgt actgttgtat gtatccagcg gcggcggcgc gcaacgaagc tatgtccaag 17700 
cgcaaaatca aagaagagat gctccaggtc atcgcgccgg agatctatgg ccccccgaag 17760 
aaggaagagc aggattacaa gccccgaaag ctaaagcggg tcaaaaagaa aaagaaagat 17820 
gatgatgatg aacttgacga cgaggtggaa ctgctgcacg ctaccgcgcc caggcgacgg 17880 
gtacagtgga aaggtcgacg cgtaaaacgt gttttgcgac ccggcaccac cgtagtcttt 17940 
acgcccggtg agcgctccac ccgcacctac aagcgcgtgt atgatgaggt gtacggcgac 18000 
gaggacctgc ttgagcaggc caacgagcgc ctcggggagt ttgcctacgg aaagcggcat 18060 
aaggacatgc tggcgttgcc gctggacgag ggcaacccaa cacctagcct aaagcccgta 18120 
acactgcagc aggtgctgcc cgcgcttgca ccgtccgaag aaaagcgcgg cctaaagcgc 18180 
gagtctggtg acttggcacc caccgtgcag ctgatggtac ccaagcgcca gcgactggaa 18240 
gatgtcttgg aaaaaatgac cgtggaacct gggctggagc ccgaggtccg cgtgcggcca 18300 
atcaagcagg tggcgccggg actgggcgtg cagaccgtgg acgttcagat acccactacc 18360 
agtagcacca gtattgccac cgccacagag ggcatggaga cacaaacgtc cccggttgcc 18420 
tcagcggtgg cggatgccgc ggtgcaggcg gtcgctgcgg ccgcgtccaa gacctctacg 18480 
gaggtgcaaa cggacccgtg gatgtttcgc gtttcagccc cccggcgccc gcgcggttcg 18540 
aggaagtacg gcgccgccag cgcgctactg cccgaatatg ccctacatcc ttccattgcg 18600 
cctacccccg gctatcgtgg ctacacctac cgccccagaa gacgagcaac tacccgacgc 18660 
cgaaccacca ctggaacccg ccgccgccgt cgccgtcgcc agcccgtgct ggccccgatt 18720 
tccgtgcgca gggtggctcg cgaaggaggc aggaccctgg tgctgccaac agcgcgctac 18780 
caccccagca tcgtttaaaa gccggtcttt gtggttcttg cagatatggc cctcacctgc 18840 
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cgcctccgtt tcccggtgcc gggattccga ggaagaatgc accgtaggag gggcatggcc 18900 
ggccacggcc tgacgggcgg catgcgtcgt gcgcaccacc ggcggcggcg cgcgtcgcac 18960 
cltcgcatgc gcggcggtat cctgcccctc cttattccac tgatcgccgc ggcgattggc 19020 
gccgtgcccg gaattgcatc cgtggccttg caggcgcaga gacactgatt aaaaacaagt 19080 
tgcatgtgga aaaatcaaaa taaaaagtct ggactctcac gctcgcttgg tcctgtaact 19140 
attttgtaga atggaagaca tcaactttgc gtctctggcc ccgcgacacg gctcgcgccc 19200 
gttcatggga aactggcaag atatcggcac cagcaatatg agcggtggcg ccttcagctg 19260 
gggctcgctg tggagcggca ttaaaaattt cggttccacc gttaagaact atggcagcaa 19320 
ggcctggaac agcagcacag gccagatgct gagggataag ttgaaagagc aaaatttcca 19380 
Icaaaaggtg gtagatggcc tggcctctgg cattagcggg gtggtggacc tggccaacca 19440 
ggcagtgcaa aataagatta acagtaagct tgatccccgc cctcccgtag aggagcctcc 19500 
accggccgtg gagacagtgt ctccagaggg gcgtggcgaa aagcgtccgc gccccgacag 19560 
ggaagaaact ctggtgacgc aaatagacga gcctccctcg tacgaggagg cactaaagca 19620 
aggcctgccc accacccgtc ccatcgcgcc catggctacc ggagtgctgg gccagcacac 19680 
acccgtaacg ctggacctgc ctccccccgc cgacacccag cagaaacctg tgctgccagg 19740 
cccgaccgcc gttgttgtaa cccgtcctag ccgcgcgtcc ctgcgocgcg ccgccagcgg 19800 
tccgcgatcg ttgcggcccg tagccagtgg caactggcaa agcacactga acagcatcgt 19860 
gggtctgggg gtgcaatccc tgaagcgccg acgatgcttc tgaatagcta acgtgtcgta 19920 
tgtgtgtcat gtatgcgtcc atgtcgccgc cagaggagct gctgagccgc cgcgcgcccg 19980 
ctttccaaga tggctacccc ttcgatgatg ccgcagtggt cttacatgca catctcgggc 20040 
caggacgcct cggagtacct gagccccggg ctggtgcagt ttgcccgcgc ^accgagacg 20100 
tacttcagcc tgaataacaa gtttagaaac cccacggtgg cgcctacgca cgacgtgacc 20160 
acagaccggt cccagcgttt gacgctgcgg ttcatccctg tggaccgtga ggatactgcg 20220 
tactcgtaca aggcgcggtt caccctagct gtgggtgata accgtgtgct ggacatggct 20280 
tccacgtact ttgacatccg cggcgtgctg gacaggggcc ctacttttaa gccctactct 20340 
ggcactgcct acaacgccct ggctcccaag ggtgccccaa atccttgcga atgggatgaa 20400 
gctgctactg ctcttgaaat aaacctagaa gaagaggacg atgacaacga agacgaagta 20460 
gacgagcaag ctgagcagca aaaaactcac gtatttgggc aggcgcctta ttctggtata 20520 
aatattacaa aggagggtat tcaaataggt gtcgaaggtc aaacacctaa atatgccgat 20580 
aaaacatttc aacctgaacc tcaaatagga gaatctcagt ggtacgaaac tgaaattaat 20640 
catgcagctg ggagagtcct taaaaagact accccaatga aaccatgtta cggttcatat 20700 
gcaaaaccca caaatgaaaa tggagggcaa ggcattcttg taaagcaaca aaatggaaag 20760 
ctagaaagtc aagtggaaat gcaatttttc tcaactactg aggcgaccgc aggcaatggt 20820 
gataacttga ctcctaaagt ggtattgtac agtgaagatg tagatataga aaccccagac 20880 
actcatattt cttacatgcc cactattaag gaaggtaact cacgagaact aatgggccaa 20940 
caatctatgc ccaacaggcc taattacatt gcttttaggg acaattttat tggtctaatg 21000 
tattacaaca gcacgggtaa tatgggtgtt ctggcgggcc aagcatcgca gttgaatgct 21060 
gttgtagatt tgcaagaoag aaacacagag ctttcatacc agcttttgct tgattccatt 21120 
ggtgatagaa ccaggtactt ttctatgtgg aatcaggctg ttgacagcta tgatccagat 21180 
gttagaatta ttgaaaatca tggaactgaa gatgaacttc caaattactg ctttccactg 21240 
ggaggtgtga ttaatacaga gactcttacc aaggtaaaac ctaaaacagg tcaggaaaat 21300 
ggatgggaaa aagatgctac agaattttca gataaaaatg aaataagagt tggaaataat 21360 
tttglcatgg aaatcaatct aaatgccaac ctgtggagaa atttcctgta ctccaacata 21420 
gcgctgtatt tgcccgacaa gctaaagtac agtccttcca acgtaaaaat ttctgataac 21480 
ccaaacacct acgactacat gaacaagcga gtggtggctc ccgggttagt ggactgctac 21540 
attaaccttg gagcacgctg gtcccttgac tatatggaca acgtcaaccc atttaaccac 21600 
caccgcaatg ctggcctgcg ctaccgctca atgttgctgg gcaatggtcg ctatgtgccc 21660 
ttccacatcc aggtgcctca gaagttcttt gccattaaaa acctccttct cctgccgggc 21720 
tcatacacct acgagtggaa cttcaggaag gatgttaaca tggttctgca gagctcccta 21780 
ggaaatgacc taagggttga cggagccagc attaagtttg atagcatttg cctttacgcc 21840 
accttcttcc ccatggccca caacaccgcc tccacgcttg aggccatgct tagaaacgac 21900 
accaacgacc agtcctttaa cgactatctc tccgccgcca acatgctcta ccctataccc 21960 
gccaacgcta ccaacgtgcc catatccatc ccctcccgca actgggcggc tttccgcggc 22020 
tgggccttca cgcgccttaa gactaaggaa accccatcac tgggctcggg ctacgaccct 22080 
tattacacct actctggctc tataccctac ctagatggaa ccttttacct caaccacacc 22140 
tttaagaagg tggccattac ctttgactct tctgtcagct ggcctggcaa tgaccgcctg 22200 
cttaccccca acgagtttga aattaagcgc tcagttgacg gggagggtta caacgttgcc 22260 
cagtgtaaca tgaccaaaga ctggttcctg gtacaaatgc tagctaacta caacattggc 22320 
taccagggct tctatatccc agagagctac aaggaccgca tgtactcctt ctttagaaac 22380 
ttccagccca tgagccgtca ggtggtggat gatactaaat acaaggacta ccaacaggtg 22440 
ggcatlctac accaacacaa caactctgga tttgttggct accttgcccc cacoatgcgc 22500 
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gaaggacagg cctaccctgc taacttcccc tatccgctta taggcaagac cgcagttgac 22560 
agcattaccc agaaaaagtt tctttgcgat cgcacccttt ggcgcatccc attctccagt 22620 
aactttatgt ccatgggcgc actcacagac ctgggccaaa accttctcta cgccaactcc 22680 
gcccacgcgc tagacatgac ttttgaggtg gatcccatgg acgagcccac ccttctttat 22740 
gttttgtttg aagtctttga cgtggtccgt gtgcaccggc cgcaccgcgg cgtcatcgaa 22800 
accgtgtacc tgcgcacgcc cttctcggcc ggcaacgcca caacataaag aagcaagcaa 22860 
catcaacaac agctgccgcc atgggctcca gtgagcagga actgaaagcc attgtcaaag 22920 
atcttggttg tgggccatat tttttgggca cctatgacaa gcgctttcca ggctttgttt 22980 
ctccacacaa gctcgcctgc gccatagtca atacggccgg tcgcgagact gggggcgtac 23040 
actggatggc ctttgcctgg aacccgcact caaaaacatg ctacctcttt gagccctttg 23100 
gcttttctga ccagcgactc aagcaggttt accagtttga gtacgagtca ctcctgcgcc 23160 
gtagcgccat tgcttcttcc cccgaccgct gtataacgct ggaaaagtcc acccaaagcg 23220 
tacaggggcc caactcggcc gcctgtggac tattctgctg catgtttctc cacgcctttg 23280 
ccaactggcc ccaaactccc atggatcaca accccaccat gaaccttatt accggggtac 23340 
ccaactccat gctcaacagt ccccaggtac agcccaccct gcgtcgcaac caggaacagc 23400 
tctacagctt cctggagcgc cactcgccct acttccgcag ccacagtgcg cagattagga 23460 
gcgccacttc tttttgtcac ttgaaaaaca tgtaaaaata atgtactaga gacactttca 23520 
ataaaggcaa atgcttttat ttgtacactc tcgggtgatt atttaccccc acccttgccg 23580 
tctgcgccgt ttaaaaatca aaggggttct gccgcgcatc gctatgcgcc actggcaggg 23640 
acacgttgcg atactggtgt ttagtgctcc acttaaactc aggcacaacc atccgcggca 23700 
gctcggtgaa gttttcactc cacaggctgc gcaccatcac caacgcgttt agcaggtcgg 23760 
gcgccgatat cttgaagtcg cagttggggc ctccgccctg cgcgcgcgag ttgcgataca 23820 
cagggttgca gcactggaac actatcagcg ccgggtggtg cacgctggcc agcacgctct 23880 
tgtcggagat cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact 23940 
ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac tcgcaccgta 24000 
gtggcatcaa aaggtgaccg tgcccggtct gggcgttagg atacagcgcc tgcataaaag 24060 
ccttgatctg cttaaaagcc acctgagcct ttgcgccttc agagaagaac atgccgcaag 24120 
acttgccgga aaactgattg gccggacagg ccgcgtcgtg cacgcagcac cttgcgtcgg 24180 
tgttggagat ctgcaccaca tttcggcccc accggttctt cacgatcttg gccttgctag 24240 
actgctcctt cagcgcgcgc tgcccgtttt cgctcgtcac atccatttca atcacgtgct 24300 
ccttatttat cataatgctt ccgtgtagac acttaagctc gccttcgatc tcagcgcagc 24360 
ggtgcagcca caacgcgcag cccgtgggct cgtgatgctt gtaggtcacc tctgcaaacg 24420 
actgcaggta cgcctgcagg aatcgcccca tcatcgtcac aaaggtcttg ttgctggtga 24480 
aggtcagctg caacccgcgg tgctcctcgt tcagccaggt cttgcatacg gccgccagag 24540 
cttccacttg gtcaggcagt agtttgaagt tcgcctttag atcgttatcc acgtggtact 24600 
tgtccatcag cgcgcgcgca gcctccatgc ccttctccca cgcagacacg atcggcacac 24660 
tcagcgggtt catcaccgta atttcacttt ccgcttcgct gggctcttcc tcttcctctt 24720 
gcgtccgcat accacgcgcc actgggtcgt cttcattcag ccgccgcact gtgcgcttac 24780 
ctcctttgcc atgcttgatt agcaccggtg ggttgctgaa acccaccatt tgtagcgcca 24840 
catcttctct ttcttcctcg ctgtccacga ttacctctgg tgatggcggg cgctcgggct 24 900 
tgggagaagg gcgcttcttt ttcttcttgg gcgcaatggc caaatccgcc gccgaggtcg 24960 
atggccgcgg gctgggtgtg cgcggcacca gcgcgtcttg tgatgagtct tcctcgtcct 25020 
cggactcgat acgccgcctc atccgctttt ttgggggcgc ccggggaggc ggcggcgacg 25080 
gggacgggga cgacacgtcc tccatggttg ggggacgtcg cgccgcaccg cgtccgcgct 25140 
cgggggtggt ttcgcgctgc tcctcttccc gactggccat ttccttctcc tataggcaga 25200 
aaaagatcat ggagtcagtc gagaagaagg acagcctaac cgccccctct gagttcgcca 25260 
ccaecgcctc caccgatgcc gccaacgcgc ctaccacctt ccccgtcgag gcacccccgc 25320 
ttgaggagga ggaagtgatt atcgagcagg acccaggttt tgtaagcgaa gacgacgagg 25380 
accgctcagt accaacagag gataaaaagc aagaccagga caacgcagag gcaaacgagg 25440 
aacaagtcgg gcggggggac gaaaggcatg gcgactacct agatgtggga gacgacgtgc 25500 
tgttgaagca tctgcagcgc cagtgcgcca" ttatctgcga cgcgttgcaa gagcgcagcg 25560 
atgtgcccct cgccatagcg gatgtcagcc ttgcctacga acgccaccta ttctcaccgc 25620 
gcgtaccccc caaacgccaa gaaaacggca catgcgagcc caacccgcgc ctcaacttct 25680 
accccgtatt tgccgtgcca gaggtgcttg ccacctatca catctttttc caaaactgca 25740 
agatacccct atcctgccgt gccaaccgca gccgagcgga caagcagctg gccttgcggc 25800 
agggcgctgt catacctgat atcgcctcgc tcaacgaagt gccaaaaatc tttgagggtc 25860 
ttggacgcga cgagaagcgc gcggcaaacg ctctgcaaca ggaaaacagc gaaaatgaaa 25920 
gtcactctgg agtgttggtg gaactcgagg gtgacaacgc gcgcctagcc gtactaaaac 25980 
gcagcatcga ggtcacccac tttgcctacc cggcacttaa cctacccccc aaggtcatga 26040 
gcacagtcat gagtgagctg atcgtgcgcc gtgcgcagcc cctggagagg gatgcaaatt 26100 
tgcaagaaca aacagaggag ggcctacccg cagttggcga cgagcagcta gcgcgctggc 26160 
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ttcaaacgcg cgagcctgcc gacttggagg agcgacgcaa actaatgatg gccgcagtgc 26220 
tcgttaccgt ggagcttgag tgcatgcagc ggttctttgc tgacccggag atgcagcgca 26280 
agctagagga aacattgcac tacacctttc gacagggcta cgtacgccag gcctgcaaga 26340 
tctccaacgt ggagctctgc aacctggtct cctaccttgg aattttgcac gaaaaccgcc 26400 
ttgggcaaaa cgtgcttcat tccacgctca agggcgaggc gcgccgcgac tacgtccgcg 26460 
actgcgttta cttatttcta tgctacacct ggcagacggc catgggcgtt tggcagcagt 26520 
gcttggagga gtgcaacctc aaggagctgc agaaactgct aaagcaaaac ttgaaggacc 26580 
tatggacggc cttcaacgag cgctccgtgg ccgcgcacct ggcggacatc attttccccg 26640 
aacgcctgct taaaaccctg caacagggtc tgccagactt caccagtcaa agcatgfctgc 26700 
agaactttag gaactttatc ctagagcgct caggaatctt gcccgccacc tgctgtgcac 26760 
ttcctagcga ctttgtgccc attaagtacc gcgaatgccc tccgccgctt tggggccact 26820 
gctaccttct gcagctagcc aactaccttg cctaccactc tgacataatg gaagacgtga 26880 
Qcqgtgacgg tctactggag tgtcactgtc gctgcaacct atgcaccccg caccgctccc 26940 
tggtttgcaa ttcgcagctg cttaacgaaa gtcaaattat cggtaccttt gagctgcagg 27000 
gtccctcgcc tgacgaaaag tccgcggctc cggggttgaa actcactccg gggctgtgga 27060 
cgtcggctta ccttcgcaaa tttgtacctg aggactacca cgcccacgag attaggttct 27120 
acgaagacca atcccgcccg ccaaatgcgg agcttaccgc ctgcgtcatt acccagggcc 27180 
acattcttgg ccaattgcaa gccatcaaca aagcccgcca agagtttctg ctacgaaagg 2724 0 
gacggggggt ttacttggac ccccagtccg gcgaggagct caacccaatc cccccgccgc 273 00 
cgcagcccta tcagcagcag ccgcgggccc ttgcttccca ggatggcacc caaaaagaag 27360 
ctgcagctgc cgccgccacc cacggacgag gaggaatact gggacagtca ggcagaggag 27420 
gttttggacg aggaggagga ggacatgatg gaagactggg agagcctaga cgaggaagct 27480 
tccgaggtcg aagaggtgtc agacgaaaca ccgtcaccct cggtcgcatt cccctcgccg 27540 
gcgccccaga aatcggcaac cggttccagc atggctacaa cctccgctcc tcaggcgccg 27600 
ccggcactgc ccgttcgccg acccaaccgt agatgggaca ccactggaac cagggccggt 27660 
aagtccaagc agccgccgcc gttagcccaa gagcaacaac agcgccaagg ctaccgctca 27720 
tggcgcgggc acaagaacgc catagttgct tgcttgcaag actgtggggg caacatctcc 27780 
ttcgcccgcc gctttcttct ctaccatcac ggcgtggcct tcccccgtaa catcctgcat 27840 
tactaccgtc atctctacag cccatactgc accggcggca gcggcagcgg cagcaacagc 27900 
agcggccaca cagaagcaaa ggcgaccgga tagcaagact ctgacaaagc ccaagaaatc 27960 
cacagcggcg gcagcagcag gaggaggagc gctgcgtctg gcgcccaacg aacccgtatc 28020 
gacccgcgag cttagaaaca ggatttttcc cactctgtat gctatatttc aacagagcag 28080 
gggccaagaa caagagctga aaataaaaaa caggtctctg cgatccctca cccgcagctg 28140 
cctgtatcac aaaagcgaag atcagcttcg gcgcacgctg gaagacgcgg aggctctctt 28200 
cagtaaatac tgcgcgctga ctcttaagga ctagtttcgc gccctttctc aaatttaagc 28260 
gcgaaaacta cgtcatctcc agcggccaca cccggcgcca gcacctgtcg tcagcgccat 28320 
tatgagcaag gaaattccca cgccctacat gtggagttac cagccacaaa tgggacttgc 28380 
ggctggagct gcccaagact actcaacccg aataaactac atgagcgcgg gaccccacat 28440 
gatatcccgg gtcaacggaa tccgcgccca ccgaaaccga attctcttgg aacaggcggc 28500 
tattaccacc acacctcgta ataaccttaa tccccgtagt tggcccgctg ccctggtgta 28560 
ccaggaaagt cccgctccca ccactgtggt acttcccaga gacgcccagg ccgaagttca 28620 
gatgactaac tcaggggcgc agcttgcggg cggctttcgt cacagggtgc ggtcgcccgg 28680 
gcagggtata actcacctga caatcagagg gcgaggtatt cagctcaacg acgagtcggt 28740 
gagctcctcg cttggtctcc gtccggacgg gacatttcag atcggcggcg ccggccgtcc 28800 
ttcattcacg cctcgtcagg caatcctaac tctgcagacc tcgtcctctg agccgcgctc 28860 
tggaggcatt ggaactctgc aatttattga ggagtttgtg ccatcggtct actttaaccc 28920 
cttctcggga cctcccggcc actatccgga tcaatttatt cctaactttg acgcggtaaa 28980 
ggactcggcg gacggctacg actgaatgtt aagtggagag gcagagcaac tgcgcctgaa 29040 
acacctggtc cactgtcgcc gccacaagtg ctttgcccgc gactccggtg agttttgcta 29100 
ctttgaattg cccgaggatc atatcgaggg cccggcgcac ggcgtccggc ttaccgccca 29160 
Qaaagaqctt gcccgtagcc tgattcggga gtttacccag cgccccctgc tagttgagcg 29220 
ggacagggga ccctgtgttc tcactgtgat ttgcaactgt cctaaccttg gattacatca 29280 
agatctttgt tgccatctct gtgctgagta taataaatac agaaattaaa atatactggg 29340 
gctcctatcg ccatcctgta aacgccaccg tcttcacccg cccaagcaaa ccaaggcgaa 29400 
ccttacctgg tacttttaac atctctccct ctgtgattta caacagtttc aacccagacg 29460 
gagtgagtct acgagagaac ctctccgagc tcagctactc catcagaaaa aacaccaccc 29520 
tccttacctg ccgggaacgt acgagtgcgt caccggccgc tgcaccacac ctaccgcctg 29580 
accgtaaacc agactttttc cggacagacc tcaataactc tgtttaccag aacaggaggt 29640 
gagcttagaa aacccttagg gtattaggcc aaaggcgcag ctactgtggg gtttatgaac 29700 
aattcaagca actctacggg ctattctaat tcaggtttct ctagaaatgg acggaattat 29760 
tacagagcag cgcctgctag aaagacgcag ggcagcggcc gagcaacagc gcatgaatca 29820 
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agagctccaa gacatggtta acttgcacca gtgcaaaagg ggtatctttt gtctggtaaa 29880 
gcaggccaaa gtcacctacg acagtaatac caccggacac cgccttagct acaagttgcc 29940 
aaccaagcgt cagaaattgg tggtcatggt gggagaaaag cccattacca taactcagca 30000 
ctcggtagaa accgaaggct gcattcactc accttgtcaa ggacctgagg atctctgcac 30060 
ccttattaag accctgtgcg gtctcaaaga tcttattccc tttaactaat aaaaaaaaat 3 0120 
aataaagcat cacttactta aaatcagtta gcaaatttct gtccagttta ttcagcagca 30180 
cctccttgcc ctcctcccag ctctggtatt gcagcttcct cctggctgca aactttctcc 30240 
acaatctaaa tggaatgtca gtttcctcct gttcctgtcc atccgcaccc actatcttca 30300 
tgttgttgca gatgaagcgc gcaagaccgt ctgaagatac cttcaacccc gtgtatccat 3 0360 
atgacacgga aaccggtcct ccaactgtgc cttttcttac tcctcccttt gtatccccca 30420 
atgggtttca agagagtccc cctggggtac tctctttgcg cctatccgaa cctctagtta 30480 
cctccaatgg catgcttgcg ctcaaaatgg gcaacggcct ctctctggac gaggccggca 3 054 0 
accttacctc ccaaaatgta accactgtga gcccacctct caaaaaaacc aagtcaaaca 30600 
taaacctgga aatatctgca cccctcacag ttacctcaga agccctaact gtggctgccg 3 0660 
ccgcacctct aatggtcgcg ggcaacacac tcaccatgca atcacaggcc ccgctaaccg 3 0720 
tgcacgactc caaacttagc attgccaccc aaggacccct cacagtgtca gaaggaaagc 30780 
tagccctgca aacatcaggc cccct caeca ccaccgatag cagfcaccctt actatcactg 30840 
cctcaccccc tctaactact gccactggta gcttgggcat tgacttgaaa gageccattt 3 0900 
atacacaaaa tggaaaacta ggactaaagt acggggctcc tttgcatgta acagacgacc 30960 
taaacacttt gaeegtagea actggtccag gtgtgactat taataatact tecttgeaaa 31020 
ctaaagttac tggagccttg ggttttgatt cacaaggcaa tatgeaaett aatgtagcag 31080 
gaggactaag gattgattct caaaacagac gecttatact tgatgttagt tatccgtttg 31140 
atgctcaaaa ccaactaaat ctaagactag gacagggccc tctttttata aactcagccc 31200 
acaacttgga tattaactac aacaaaggee tttacttgtt tacagcttca aacaattcca 31260 
aaaagcttga ggttaaccta agcactgcca aggggttgat gtttgacget acagecatag 31320 
ecattaatge aggagatggg cttgaatttg gttcacctaa tgcaccaaac acaaatcccc 31380 
tcaaaacaaa aattggccat ggcctagaat ttgattcaaa caaggctatg gttcctaaac 3144 0 
taggaactgg ccttagtttt gacagcacag gtgecattae agtaggaaac aaaaataatg 31500 
ataagctaac tttgtggacc acaccagctc catctcctaa ctgtagacta aatgeagaga 31560 
) aagatgctaa actcactttg gtcttaacaa aatgtggcag tcaaatactt gctacagttt 31620 
cagttttggc tgttaaaggc agtttggctc caatatctgg aacagttcaa agtgctcatc 31680 
ttattataag atttgacgaa aatggagtgc tactaaacaa ttccttcctg gacccagaat 31740 
attggaactt tagaaatgga gatcttactg aaggcacagc ctatacaaac gctgttggat 31800 
ttatgcctaa cctatcagct tatccaaaat etcaeggtaa aactgecaaa agtaacattg 31860 
tcagtcaagt ttacttaaac ggagacaaaa ctaaacctgt aacactaacc attacactaa 31920 
aeggtacaca ggaaacagga gacacaactc caagtgeata ctctatgtca ttttcatggg 31980 
actggtctgg ccacaactac attaatgaaa tatttgecac atcctcttac actttttcat 32040 
acattgccca agaataaaga atcgtttgtg ttatgtttca acgtgtttat ttttcaattg 32100 
cagaaaattt caagtcattt .ttcattcagt agtatagece caccaccaca tagcttatac 32160 
agatcaccgt accttaatca aactcacaga accctagtat tcaacctgcc acctccctcc 32220 
caacacacag agtacacagt cctttctccc cggctggcct taaaaagcat catatcatgg 32280 
gtaacagaca tattcttagg tgttatattc cacacggttt cctgtcgagc caaacgctca 32340 
tcagtgatat taataaactc cccgggcagc tcacttaagt teatgtcget gtccagctgc 32400 
tgagccacag gctgctgtcc aacttgeggt tgettaaegg geggegaagg agaagtccac 32460 
gectacatgg gggtagagtc ataatcgtgc atcaggatag ggcggtggtg ctgcagcagc 32520 
gcgcgaataa actgctgccg ccgccgctcc gtcctgeagg aatacaacat ggcagtggtc 32580 
tcctcagcga tgattegcac cgcccgcagc ataaggegee ttgtcctccg ggcacagcag 32640 
cgcaccctga tctcacttaa atcagcacag taactgeage acagcaccac aatattgttc 32700 
aaaatcccac agtgcaaggc gctgtatcca aagctcatgg cggggaccac agaacccacg 32760 
tggecatcat accacaagcg caggtagatt aagtggcgac ccctcataaa cacgctggac 32820 
ataaacatta cctcttttgg catgttgtaa ttcaccacct cccggtacca tataaacctc 32880 
tgattaaaca tggcgccatc caccaccatc ctaaaccagc tggecaaaac ctgcccgccg 32940 
gctatacact gcagggaacc gggactggaa caatgacagt ggagagecca ggactcgtaa 33000 
ccatggatca teatgetegt catgatatca atgttggcac aacacaggca caegtgeata 33060 
cacttcctca ggattacaag ctcctcccgc gfctagaacca tatcccaggg aacaacccat 33120 
tcctgaatca gegtaaatec cacactgcag ggaagacctc geaegtaact cacgttgtgc 33180 
attgtcaaag tgttacattc gggcagcagc ggatgatcct ccagtatggt agegegggtt 33240 
tctgtctcaa aaggaggtag acgatcccta ctgtacggag tgegecgaga caaccgagat 33300 
cgtgttggtc gtagtgtcat gccaaatgga acgccggacg tagtcatatt tcctgaagca 33360 
aaaccaggtg egggegtgae aaacagatct gcgtctccgg tctcgccgct tagatcgetc 33420 
tgtgtagtag ttgtagtata tccactctct caaagcatcc aggcgccccc tggcttcggg 33480 
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ttetatgtaa actccttcat gcgccgctgc cctgataaca tccaccaccg cagaataagc 33540 
cacacccagc caacctacac attcgttctg cgagtcacac acgggaggag cgggaagagc 33600 
^aaaSalc atgttttttt ttttattcca aaagattatc caaaacctca aaatgaagat 33660 
cllttlagtg Scgcgctcc cctccggtgg cgtggtcaaa ctctacagcc aaagaacaga 33720 
fcaataacatt tgtiagatgt tgcacaatgg cttccaaaag gcaaacggcc ctcacgtcca 33780 
agtgglcgla aaggctaaac clttcagggt gaatctcctc tataaacatt ccagcaoctt 33840 
clallaticc calataattc tcatctogcc accttctcaa tatatctcta agcaaatccc 33900 
Slatattlag tccggccatt gtaaaaatct gctccagagc gccctccacc ttcagcctca 33960 
agSgcgaa? catglttgca laaattcagg ttcctcacag acctgtataa 9attcaaaag 34020 
tggScitta acalaaatac cgcgatcccg taggtccctt cgcagggcca S^gaacata 34080 
atlgtgcagg tctgcacgga ccagcgcggc cacttccccg ccaggaacct tgacaaaaga 34140 
acSalactl attltgacac goatactcgg agctatgcta accagcgtag ccccgatgta 34200 
lactttgttl catgggcggc gatataaaat gcaaggtgct gctcaaaaaa tcaggcaaag 34260 
cctcgclcal aaaf|aalgc Icatcgtagt catgctoatg cagataaagg ^aggtaagct 34320 
ccoaaaccac cacagaaaaa gacaccattt ttctctcaaa catgtctgcg ggtttctgca 343 80 
tlllclcaal ataafataac laaaaaacat ttaaacatta gaagcctgtc ttacaacagg 34440 
aaalacatS ctlataagca taagacggac tacggccatg ccggcgtgac cgtaaaaaaa 34500 
ItSotctccg tgattaaLa gcaccaccga cagctcctcg gtcatgtccg gagtcataat 34560 
gSlgactcI gtaaacacat caggttgatt catcggtcag tgctaaaaag cgaccgaaat 34620 
Iqcccggggg iatacatacc cgcaggcgta gagacaacat tacagccccc ataggaggta 34680 
ff^afaftt aataacraqaq aaaaacacat aaacacctga aaaaccctcc tgcctaggca 34740 
attttgctcc ttccSctcI tgaacaacat acagcgcttc cacagcggca gccataacag 34800 
tcaSttac cagtalaaaa gaaaacctat taaaaaaaca ccactcgaca cggcaccagc 34860 
tctltcagtc aclgtgtaaa aaagggccaa gtgcagagcg agtatatata ggactaaaaa 34920 
ataacataac ggttaaagtc cacaaaaaac acccagaaaa ccgcacgcga acctacgccc 34980 
agSIIgata gfcaaaalac ccacaacttc ctcaaatcgt cacttccgtt ttcccacgtt 35040 
Icatcacttc Icattttaat taagaaaact acaattccca acacatacaa gttactccgc 35100 
Iclaaaacct acgtcacccg ccccgttccc acgccccgcg ccacgtcaca aactccaccc 35160 
cctcattatc atattggctt caatccaaaa taaggtatat tattgatgat g 



<210> 44 
<211> 33622 
<2X2> DNA 

<213> Artificial Sequence 
<220> 

<223> Plasraid Av3nBg 
<400> 44 



catcatcaat aatatacctt attttggatt gaagccaata tgataatgag ggggtggagt 60 
ttg^gacgtg gcgcggggcg tgggaacggg gcgggtgacg tagtagtgtg ^ggaagtgt 120 
qatgltgcaa gtgtggcgga acacatgtaa gcgacggatg tggcaaaagt gacgtttttg 180 
ItgScgccg gtgtllacag gaagtgacaa ttttcgcgcg gttttaggcg gatgttgtag 240 
taaatttggg Iglaaccgag taagatttgg ccattttcgc gggaaaactg aataagagga 300 
agtgaaatct gfataatttl gtgltactca tagcgcgtaa tatttgtcta SS<3^*& |« 
Sctttgacc gtttacgtgg agactcgccc agggcgcgcc ccgatgtacg ^^abat 420 
acgcgtatct gaggggacta gggtgtgttt aggcgaaaag cggggcttcg gttftacgcg 480 
attalaaatc ccctcaggat atagtagttt cgcttttgca tagggagggg gaaatgtagt 540 
IttafSIat actcttltag tctlgclaca tggtaacgat gagttagcaa catgccttac 600 
laggafagaa allgcaccgt gcatfccgat tggtggaagt aaggtggtac gatcgtgcct 660 
taltaggaag gcalcagacg ggtctgacat ggattggacg aaccactgaa ttccgcattg 720 
caaaaatatt qtatttaagt gcctagctcg atacaataaa cgccatttga ccattcacca vbo 
ct?taStotq facet ccggc Ictggccact ctcttccgca tcgctgtctg cgggggccag 840 
ctgttlggct cgcgattgag gacllactct tcgcggtctt tccagtactc ttggatcgga 900 
Sl-cotS lltclgalcg gtactccgcc gccgagggac ctgagcgagt ccgcatcgac 960 
cggatcggll alcctctcgl gaaaggcgtg taaccagtca cagtcgctct agaactagtg 1020 
ratcccccaa actacaggaa ttcgatctag atggataaag gtccaaaaaa gaagagaaag 1080 
HaQalgacI Icalgglltt tccttcagai ttgctaagtt ttttgagtga ttcactggcc 1140 
gestae aatgffgtga ctgggaaaac cctggcgtta cccaacttaa tcgccttgca 1200 
acacatcccc ctttcgccag ctggcgtaat agegaagagg cccgcaccga tcgcccttcc l^bu 
lllcagttgc gcagcltgal tggcgaatgg cgctttgcct ggtttccggc accagaagcg 1320 
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gtgccggaaa gctggctgga gtgcgatctt 
aactggcaga tgcacggtta cgatgcgccc 
gtcaatccgc cgtttgttcc cacggagaat 
gttgatgaaa gctggctaca ggaaggccag 
gcgtttcatc tgtggtgcaa cgggcgctgg 
tctgaatttg acctgagcgc atttttacgc 
ctgcgttgga gtgacggcag ttatctggaa 
ttccgtgacg tctcgttgct gcataaaccg 
actcgcttta atgatgattt cagccgcgct 
gagttgcgtg actacctacg ggtaacagtt 
agcggcaccg cgcctttcgg cggtgaaatt 
gtcacactac gtctgaacgt cgaaaacccg 
tatcgtgcgg tggttgaact gcacaccgcc 
gatgtcggtt tccgcgaggt gcggattgaa 
ttgctgattc gaggcgttaa ccgtcacgag 
gagcagacga tggtgcagga tatcctgctg 
tgttcgcatt atccgaacca tccgctgtgg 
gtggtggatg aagccaatat tgaaacccac 
gatccgcgct ggctaccggc gatgagcgaa 
aatcacccga gtgtgatcat ctggtcgctg 
gacgcgctgt atcgctggat caaatctgtc 
ggcggagccg acaccacggc caccgatatt 
gaccagccct tcccggctgt gccgaaatgg 
gagacgcgcc cgctgatcct ttgcgaatac 
ttcgctaaat actggcaggc gtttcgtcag 
gactgggtgg atcagtcgct gattaaatat 
ggcggtgatt ttggcgatac gccgaacgat 
gccgaccgca cgccgcatcc agcgctgacg 
ttccgtttat ccgggcaaac catcgaagtg 
aacgagctcc tgcactggat ggtggcgctg 
cctctggatg tcgctccaca aggtaaacag 
gagagcgccg ggcaactctg gctcacagta 
tcagaagccg ggcacatcag cgcctggcag 
acgctccccg ccgcgtccca cgccatcccg 
atcgagctgg gtaataagcg ttggcaattt 
tggattggcg ataaaaaaca actgctgacg 
ctggataacg acattggcgt aagtgaagcg 
cgctggaagg cggcgggcca ttaccaggcc 
acacttgctg atgcggtgct gattacgacc 
ttatttatca gccggaaaac ctaccggatt 
gatgttgaag tggcgagcga tacaccgcat 
gcgcaggtag cagagcgggt aaactggctc 
cgccttactg ccgcctgttt tgaccgctgg 
tacgtcttcc cgagcgaaaa cggtctgcgc 
caccagtggc gcggcgactt ccagttcaac 
gaaaccagcc atcgccatct gctgcacgcg 
ttccatatgg ggattggtgg cgacgactcc 
ctgagcgccg gtcgctacca ttaccagttg 
aagcttatcg ataccgtcga aacttgttta 
atagcatcac aaatttcaca aataaagcat 
ccaaactcat caatgtatct tatcatgtct 
ggtacgatga gacccgcacc aggtgcagac 
accagcctgt gatgctggat gtgaccgagg 
gcacccgcgc tgagtttggc tctagcgatg 
ggcgtggctt aagggtggga aagaatatat 
gttttgcagc agccgccgcc gccatgagca 
catatttgac aacgcgcatg cccccatggg 
gcattgatgg tcgccccgtc ctgcccgcaa 
ctggaacgcc gttggagact gcagcctccg 
gcgggattgt gactgacttt gctttcctga 
catccgcccg cgatgacaag ttgacggctc 



cctgaggccg atactgtcgt cgtcccctca 1380 
atctacacca acgtaaccta tcccattacg 1440 
ccgacgggtt gttactcgct cacatttaat 1500 
acgcgaatta tttttgatgg cgttaactcg 1560 
gtcggttacg gccaggacag tcgtttgccg 1620 
gccggagaaa accgcctcgc ggtgatggtg 1680 
gatcaggata tgtggcggat gagcggcatt 1740 
actacacaaa tcagcgattt ccatgttgcc 1800 
gtactggagg ctgaagttca gatgtgcggc 1860 
tctttatggc agggtgaaac gcaggtcgcc 1920 
atcgatgagc gtggtggtta tgccgatcgc 1980 
aaactgtgga gcgccgaaat cccgaatctc 2040 
gacggcacgc tgattgaagc agaagcctgc 2100 
aatggtctgc tgctgctgaa cggcaagccg 2160 
catcatcctc tgcatggtca ggtcatggat 2220 
atgaagcaga acaactttaa cgccgtgcgc 2280 
tacacgctgt gcgaccgcta cggcctgtat 2340 
ggcatggtgc caatgaatcg tctgaccgat 2400 
cgcgtaacgc gaatggtgca gcgcgatcgt 2460 
gggaatgaat caggccacgg cgctaatcac 2520 
gatccttccc gcccggtgca gtatgaaggc 2580 
atttgcccga tgtacgcgcg cgtggatgaa 2640 
tccatcaaaa aatggctttc gctacctgga 2700 
gcccacgcga tgggtaacag tcttggcggt 2760 
tatccccgtt tacagggcgg cttcgtctgg 2820 
gatgaaaacg gcaacccgtg gtcggcttac 2880 
cgccagttct gtatgaacgg tctggtcttt 2940 
gaagcaaaac accagcagca gtttttccag 3000 
accagcgaat acctgttccg tcatagcgat 3060 
gatggtaagc cgctggcaag cggtgaagtg 3120 
ttgattgaac tgcctgaact accgcagccg 3180 
cgcgtagtgc aaccgaacgc gaccgcatgg 3240 
cagtggcgtc tggcggaaaa cctcagtgtg 3300 
catctgacca ccagcgaaat ggatttttgc 3360 
aaccgccagt caggctttct ttcacagatg 3420 
ccgctgcgcg atcagttcac ccgtgcaccg 3480 
acccgcattg accctaacgc ctgggtcgaa 3540 
gaagcagcgt tgttgcagtg cacggcagat 3600 
gctcacgcgt ggcagcatca ggggaaaacc 3660 
gatggtagtg gtcaaatggc gattaccgtt 3720 
ccggcgcgga ttggccfcgaa ctgccagctg 3780 
ggattagggc cgcaagaaaa ctatcccgac 3840 
gatctgccat tgtcagacat gtataccccg 3900 
tgcgggacgc gcgaattgaa ttatggccca 3960 
atcagccgct acagtcaaca gcaactgatg 4020 
gaagaaggca catggctgaa tatcgacggt 4080 
tggagcccgt cagtatcggc ggaatttcag 4140 
gtctggtgtc aaaaataata atctcgaatc 4200 
ttgcagctta taatggttac aaataaagca 4260 
ttttttcact gcattctagt tgtggtttgt 4320 
ggatccgacc tcggatctgg aaggtgctga 4380 
cctgcgagtg tggcggtaaa catattagga 4440 
agctgaggcc cgatcacttg gtgctggcct 4500 
aagatacaga ttgaggtact gaaatgtgtg 4560 
aaggtggggg tcttatgtag ttttgtatct 4620 
ccaactcgtt tgatggaagc attgtgagct 4680 
ccggggtgcg tcagaatgtg atgggctcca 4740 
actctactac cttgacctac gagaccgtgt 4800 
ccgccgcttc agccgctgca gccaccgccc 4860 
gcccgcttgc aagcagtgca gcttcccgtt 4920 
ttttggcaca attggattct ttgacccggg 4980 
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aacttaatgt cgtttctcag cagctgttgg 
cttcctcccc tcccaatgcg gtttaaaaca 
ggatcaagca agtgtcttgc tgtctttatt 
accagcggtc tcggtcgttg agggtcctgt 
tctggatgtt cagatacatg ggcataagcc 
gagcttcatg ctgcggggtg gtgttgtaga 
ggtgcctaaa aatgtctttc agtagcaagc 
tgtttacaaa gcggttaagc tgggatgggt 
actgtatttt taggttggct atgttcccag 
gaaccaccag cacagtgtat ccggtgcact 
atgcgtggaa gaacttggag acgcccttgt 
taatgatggc aatgggccca cgggcggcgg 
cgtcatagtt gtgttccagg atgagatcgt 
gggtgccaga ctgcggtata atggttccat 
tttgcatttc ccacgctttg agttcagatg 
agaaaacggt ttccggggta ggggagatca 
gcgacttacc gcagccggtg ggcccgtaaa 
taagagagct gcagctgccg tcatccctga 
tgactcgcat gttttccctg accaaatccg 
gttcttgcaa ggaagcaaag tttttcaacg 
tgagcgtttg accaagcagt tccaggcggt 
ctcgatccag catatctcct cgtttcgcgg 
tcggtgctcg tccagacggg ccagggtcat 
cgtagtctgg gtcacggtga aggggtgcgc 
gaggctggtc ctgctggtgc tgaagcgctg 
gcatttgacc atggtgtcat agtccagccc 
gcccttggag gaggcgccgc acgaggggca 
cgcgagaaat accgattccg gggagtaggc 
gcattccacg agccaggtga gctctggccg 
ctttttgatg cgtttcttac ctctggtttc 
aaggctgtcc gtgtccccgt atacagactt 
gtcctcctcg tatagaaact cggaccactc 
gaaggaggct aagtgggagg ggtagcggtc 
ggtgtgaaga cacatgtcgc cctcttcggc 
ggccacgtga ccgggtgttc ctgaaggggg 
ctcactctct tccgcatcgc tgtctgcgag 
aaaagcgggc atgacttctg cgctaagatt 
attcacctgg cccgcggtga tgcctttgag 
aatctttttg ttgtcaagct tggtggcaaa 
ggcgatggag cgcagggttt ggtttttgtc 
tagctgcacg tattcgcgcg caacgcaccg 
gggcaccagg tgcacgcgcc aaccgcggtt 
tacctctccg cgtaggcgct cgttggtcca 
tggcggtagg gggtctagct gcgtctcgtc 
gggcagcagg cgcgcgtcga agtagtctat 
ccatgcgcgg gcggcaagcg cgcgctcgta 
gtgggtgagc gcggaggcgt acatgccgca 
tattccaaga tatgtagggt agcatcttcc 
tagttcgtgc gagggagcga ggaggtcggg 
tcggaagact atctgcctga agatggcatg 
gacgttgaag ctggcgtctg tgagacctac 
gcgcagcttg ttgaccagct cggcggtgac 
ttccttgatg atgtcatact tatcctgtcc 
aaactcttcg cggtctttcc agtactcttg 
agagcctagc atgtagaact ggttgacggc 
tagcgcgtat gcctgcgcgg ccttccggag 
gaccatgact ttgaggtact ggtatttgaa 
gagcaaaaag tccgtgcgct ttttggaacg 
gaagagtatc tttcccgcgc gaggcataaa 
ctcggaacgg ttgttaatta cctgggcggc 
gtggcccaca atgtaaagtt ccaagaagcg 



atctgcgcca gcaggtttct gccctgaagg 5040 
taaataaaaa accagactct gtttggattt 5100 
taggggtttt gcgcgcgcgg taggcccggg 5160 
gtattttttc caggacgtgg taaaggtgac 5220 
cgtctctggg gtggaggtag caccactgca 52 80 
tgatccagtc gtagcaggag cgctgggcgt 5340 
tgattgccag gggcaggccc ttggtgtaag 5400 
gcatacgtgg ggatatgaga tgcatcttgg 5460 
ccatatccct ccggggattc atgttgtgca 5520 
tgggaaattt gtcatgtagc ttagaaggaa 5580 
gacctccaag attttccatg cattcgtcca 5640 
cctgggcgaa gatatttctg ggatcactaa 57 00 
cataggccat ttttacaaag cgcgggcgga 57 60 
ccggcccagg ggcgtagtta ccctcacaga 5820 
gggggatcat gtctacctgc ggggcgatga 58 80 
gctgggaaga aagcaggttc ctgagcagct 5940 
tcacacctat taccggctgc aactggtagt 6000 
gcaggggggc cacttcgtta agcatgtccc 6060 
ccagaaggcg ctcgccgccc agcgatagca 6120 
gtttgagacc gtccgccgta ggcatgcttt 6180 
cccacagctc ggtcacctgc tctacggcat 6240 
gttggggcgg ctttcgctgt acggcagtag 63 00 
gtctttccac gggcgcaggg tcctcgtcag 6360 
tccgggctgc gcgctggcca gggtgcgctt 6420 
ccggtcttcg ccctgcgcgt cggccaggta 6480 
ctccgcggcg tggcccttgg cgcgcagctt 6540 
gtgcagactt ttgagggcgt agagcttggg 6600 
atccgcgccg caggccccgc agacggtctc 6660 
ttcggggtca aaaaccaggt ttcccccatg 6720 
catgagccgg tgtccacgct cggtgacgaa 6780 
gagaggcctg tcctcgagcg gtgttccgcg 6840 
tgagacaaag gctcgcgtcc aggccagcac 6900 
gttgtccact agggggtcca ctcgctccag 6960 
atcaaggaag gtgattggtt tgtaggtgta 7020 
gctataaaag ggggtggggg cgcgttcgtc 7080 
ggccagctgt tggggtgagt actccctctg 7140 
gtcagtttcc aaaaacgagg aggatttgat 7200 
ggtggccgca tccatctggt cagaaaagac 7260 
cgacccgtag agggcgttgg acagcaactt 7320 
gcgatcggcg cgctccttgg ccgcgatgtt 7380 
ccattcggga aagacggtgg tgcgctcgtc 7440 
gtgcagggtg acaaggtcaa cgctggtggc 7500 
gcagaggcgg ccgcccttgc gcgagcagaa 7560 
cggggggtct gcgtccacgg taaagacccc 7620 
cttgcatcct tgcaagtcta gcgcctgctg 7680 
tgggttgagt gggggacccc atggcatggg 7740 
aatgtcgtaa acgtagaggg gctctctgag 7800 
accgcggatg ctggcgcgca cgtaatcgta 7860 
accgaggttg ctacgggcgg gctgctctgc 7920 
tgagttggat gatatggttg gacgctggaa 7980 
cgcgtcacgc acgaaggagg cgtaggagtc 804 0 
ctgcacgtct agggcgcagt agtccagggt 8100 
cttttttttc cacagctcgc ggttgaggac 8160 
gatcggaaac ccgtcggcct ccgaacggta 8220 
ctggtaggcg cagcatccct tttctacggg 8280 
cgaggtgtgg gtgagcgcaa aggtgtccct 8340 
gtcagtgtcg tcgcatccgc cctgctccca 8400 
cggatttggc agggcgaagg tgacatcgtt 8460 
gttgcgtgtg atgcggaagg gtcccggcac 8520 
gagcacgatc tcgtcaaagc cgttgatgtt 8580 
cgggatgccc ttgatggaag gcaatttttt 8640 
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aagttcctcg taggtgagct cttcagggga gctgagcccg tgctctgaaa gggcccagtc 8700 
tgcaagatga gggttggaag cgacgaatga gctccacagg tcacgggcca ttagcatttg 8760 
caggtggtcg cgaaaggtcc taaactggcg acctatggcc attttttctg gggtgatgca 8820 
gtagaaggta agcgggtctt gttcccagcg gtcccatcca aggttcgcgg ctaggtctcg 8880 
cgcggcagtc actagaggct catctccgcc gaacttcatg accagcatga agggcacgag 8940 
ctgcttccca aaggccccca tccaagtata ggtctctaca tcgtaggtga caaagagacg 9000 
ctcggtgcga ggatgcgagc cgatcgggaa gaacfcggatc tcccgccacc aattggagga 9060 
gtggctattg atgtggtgaa agtagaagtc cctgcgacgg gccgaacact cgtgctggct 9120 
tttgtaaaaa cgtgcgcagt actggcagcg gtgcacgggc tgtacatcct gcacgaggtt 9180. 
gacctgacga ccgcgcacaa ggaagcagag tgggaatttg agcccctcgc ctggcgggtt 9240 
tggctggtgg tcttctactt cggctgcttg tccttgaccg tctggctgct cgaggggagt 9300 
tacggtggat cggaccacca cgccgcgcga gcccaaagtc cagatgtccg cgcgcggcgg 9360 
tcggagcttg atgacaacat cgcgcagatg ggagctgtcc atggtctgga gctcccgcgg 9420 
cgtcaggtca ggcgggagct cctgcaggtt tacctcgcat agacgggtca gggcgcgggc 9480 
tagatccagg'tgatacctaa tttccagggg ctggttggtg gcggcgtcga tggcttgcaa 954 0 
gaggccgcat ccccgcggcg cgactacggt accgcgcggc gggcggtggg ccgcgggggt 9600 
gtccttggat gatgcatcta aaagcggtga cgcgggcgag cccccggagg tagggggggc 9660 
tccggacccg ccgggagagg gggcaggggc acgtcggcgc cgcgcgcggg caggagctgg 9720 
tgctgcgcgc gtaggttgct ggcgaacgcg acgacgcggc ggttgatctc ctgaatctgg 9780 
cgcctctgcg tgaagacgac gggcccggtg agcttgagcc tgaaagagag ttcgacagaa 9840 
tcaatttcgg tgtcgttgac ggcggcctgg cgcaaaatct cctgcacgtc tcctgagttg 9900 
tcttgatagg cgatctcggc catgaactgc tcgatctctt cctcctggag atctccgcgt 9960 
ccggctcgct ccacggtggc ggcgaggtcg ttggaaatgc gggccatgag ctgcgagaag 10020 
gcgttgaggc ctccctcgtt ccagacgcgg ctgtagacca cgcccccttc ggcatcgcgg 10080 
gcgcgcatga ccacctgcgc gagattgagc tccacgtgcc gggcgaagac ggcgtagttt 10140 
cgcaggcgct gaaagaggta gttgagggtg gtggcggtgt gttctgccac gaagaagtac 10200 
ataacccagc gtcgcaacgt ggattcgttg atatccccca aggcctcaag gcgctccatg 10260 
gcctcgtaga agtccacggc gaagttgaaa aactgggagt tgcgcgccga cacggttaac 10320 
tcctcctcca gaagacggat gagctcggcg acagtgtcgc gcacctcgcg ctcaaaggct 10380 
acaggggcct cttcttcttc ttcaatctcc tcttccataa gggcctcccc ttcttcttct 10440 
tctggcggcg gtgggggagg ggggacacgg cggcgacgac ggcgcaccgg gaggcggtcg 10500 
acaaagcgct cgatcatctc cccgcggcga cggcgcatgg tctcggtgac ggcgcggccg 10560 
ttctcgcggg ggcgcagttg gaagacgccg cccgtcatgt cccggttatg ggttggcggg 10620 
gggctgccat gcggcaggga tacggcgcta acgatgcatc tcaacaattg ttgtgtaggt 10680 
actccgccgc cgagggacct gagcgagtcc gcatcgaccg gatcggaaaa cctctcgaga 10740 
aaggcgtcta accagtcaca gtcgcaaggt aggctgagca ccgtggcggg cggcagcggg 10800 
cggcggtcgg ggttgtttct ggcggaggtg ctgctgatga tgtaattaaa gtaggcggtc 10860 
ttgagacggc ggatggtcga cagaagcacc atgtccttgg gtccggcctg ctgaatgcgc 10920 
aggcggtcgg ccatgcccca ggcttcgttt tgacatcggc gcaggtcttt gtagtagtct 10980 
tgcatgagcc tttctaccgg cacttcttct tctccttcct cttgtcctgc atctcttgca 11040 
tctatcgctg cggcggcggc ggagtttggc cgtaggtggc . gccctcttcc tcccatgcgt 11100 
gtgaccccga agcccctcat cggctgaagc agggctaggt cggcgacaac gcgctcggct 11160 
aatatggcct gctgcacctg cgtgagggta gactggaagt catccatgtc cacaaagcgg 11220 
tggtatgcgc ccgtgttgat ggtgtaagtg cagttggcca taacggacca gttaacggtc 11280 
tggtgacccg gctgcgagag ctcggtgtac ctgagacgcg agtaagccct cgagtcaaat 11340 
acgtagtcgt tgcaagtccg caccaggtac tggtatccca ccaaaaagtg cggcggcggc 11400 
tggcggtaga ggggccagcg tagggtggcc ggggctccgg gggcgagatc ttccaacata 11460 
aggcgatgat atccgtagat gtacctggac atccaggtga tgccggcggc ggtggtggag 11520 
gcgcgcggaa agtcgcggac gcggttccag atgttgcgca gcggcaaaaa gtgctccatg 11580 
gtcgggacgc tctggccggt caggcgcgcg caatcgttga cgctctagac cgtgcaaaag 11640 
gagagcctgt aagcgggcac tcttccgtgg tctggtggat aaattcgcaa gggtatcatg 11700 
gcggacgacc ggggttcgag ccccgtatcc ggccgtccgc cgtgatccat gcggttaccg 11760 
cccgcgtgtc gaacccaggt gtgcgacgtc agacaacggg ggagtgctcc ttttggcttc 11820 
cttccaggcg cggcggctgc tgcgctagct tttttggcca ctggccgcgc gcagcgtaag 11880 
cggttaggct ggaaagcgaa agcattaagt ggctcgctcc ctgtagccgg agggttattt 11940 
tccaagggtt gagtcgcggg acccccggtt cgagtctcgg accggccgga ctgcggcgaa 12000 
cgggggtttg cctccccgtc atgcaagacc ccgcttgcaa attcctccgg aaacagggac 12060 
gagccccttt tttgcttttc ccagatgcat ccggtgctgc ggcagatgcg cccccctcct 12120 
cagcagcggc aagagcaaga gcagcggcag acatgcaggg caccctcccc tcctcctacc 12180 
gcgtcaggag gggcgacatc cgcggttgac gcggcagcag atggtgatta cgaacccccg 12240 
cggcgccggg cccggcacta cctggacttg gaggagggcg agggcctggc gcggctagga 12300 
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g c 9 c=ct=tc ..J— cco» s ^sss s&ss ssssss Si£ 

SSS3SS ££££ 1^1411 SSSS |i ggotgj. gjj. 

glggaggact Wgagcccga <=9=9cg~cc Mjms» "pallia ctttlallal 12600 

5SSSS SSSSSS5 SSSK 3S5S ^« ~-| ««• 

ssss sssss sssss ii Ss ~* sr. 

SS5SSS SSSSS gl | «™ SSS 

Itl.goctgg 9=»9tt". ^ccogoajg ^g g ?glt ta=ct:tglg= 13020 

iSaiis sssss «™i ffi-sA ^i-gt gss- g»s 

3S5E SSSSS SSSSS S g fsSS SSSS SSS 

agccgacgcg ccctggaggc agctggggcc sgacctgggc tggcggtgc ^33^ Q 

S35SS 3SSS S5SSS SS S| I f~~E» £»• 

SSSS SSSSS iSgll g i i| |l 233 SKS 
2KSES SSSSE 3SSS aWAj 3S = S3 ««• 

sbss sssss insist g s i sjs 

5SSSSSS: S3KSS SS5SS SEES ^| gag ufcl^c 130,0 

tlatggtg.c tgagac.ccg c~>gtjj»99 gSIttSaa aacttgc.gg 13980 

agaccagtag acaaggectg cagacegtaa acctgagcca 93=' ttactaaege 14040 

cggcaglct ggaggcaacc ctaaactacc t9«tgacc»a ccggcggcag aagatcccct 1428 
cgtEgcacag tttaaacagc gaggaggagc geattttgeg ^tacgtgcag ^99 9 
glctlaacct gatgegegae ggggtaaege ccagcgtggc a tg|altact 14460 

Icatggaacc gggcatgtat gcctcaaacc g^egtttat ^aaccgccta «9| 14520 
tgcatcgcgc ggccgccgtg aaccccgagt atttcaccaa *9«*2gg aacgatggat 14580 
ggctaccgcc ccctggtttc tacacegggg |^tcgaggt g^cgagggt a 
tcctctggga cgacatagac gacagcgtgt "tccccgca *u a = aggecaagea 14700 
tgcaacagcg cgagcaggca gaggeggege tgcgaaagga aagcttccgc aggc ? ^ q 
gcttgtccga tetaggeget gcggccccgc ggtcagatgc tagtagc | 14820 

tgatagggtc tcttaccagc actcgcacca cccgcccgcg cctgctgggc gagg ga a 

™S S3SS3 SSKS I f ~ iSSSS 

SSSKSS SS3S? SI 1 i™ fesS SEE SSS°. 
S3SSS SEES SSSS K33S yW S j S 

Iccolttlgt atgcggcgcg cggcgatgta t g a gg .«ggt c «=°"°" cSSSSggl 15300 

23SS5S S3ESS S5535 f £££2 EX 

S2SSSS sssss JSSSS =11 I I ffj- sss 

Ilaclatgac tacagcccgg gggaggcaag cacacagacc tglacglitt 15600 

gcactggggc ggcgacctga aaaccatcct ^ataccaac atgccaaatg j, gj^ 1566Q 
catgtttacc aataagttta aggegegggt gatggtgccg cgc a actactccga 15720 

tcalgtggag ctgaaatacg agtgggtgga gjcacgctg cccgaggg^ | g 15780 

gaccatgacc atagacctta tgaacaaege S«cs^33 » gacacccgca acttcagact 15840 
acagaaeggg g^tctggaaa gegacategg 3|^J«^J ? a tacaalcg aagccttcca 15900 
?^aga=Sc at^gctg^ SggaScgg SSSStc acccacagcc gectgagcaa 15960 
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cttgttgggc 
tctggagggt 
agatgacacc 
ggaagagaac 
tgccattcgc 
agcggccgaa 
ggtgatcaaa 
cagcaccttc 
cggaatccgc 
ctactggtcg 
cagcaacttt 
cgaccaggcc 
tcgctttccc 
tgaaaacgtt 
agtccagcga 
cctgggcata 
ccttatatcg 
cggggccaag 
ctggggcgcg 
ggtggtggag 

ggccattcag 
gcgcgtagca 
cctgcttaac 
ggccgcgggt 
cgcggccatt 
ggttagcggc 
aaactactta 
gtc.caagcgc 
cccgaagaag 
gaaagatgat 
gcgacgggta 
agtctttacg 
cggcgacgag 
gcggcataag 
gcccgtaaca 
aaagcgcgag 
actggaagat 
gcggccaatc 
cactaccagt 
ggttgcctca 
ctctacggag 
cggttcgagg 
cattgcgcct 
ccgacgccga 
cccgatttcc 
gcgctaccac 
cacctgccgc 
catggccggc 
gtcgcaccgt 
gattggcgcc 
aacaagttgc 
tgtaactatt 
cgcgcccgtt 
tcagctgggg 
gcagcaaggc 
atttccaaca 
ccaaccaggc 
agcctccacc 
ccgacaggga 
taaagcaagg 
agcacacacc 



atccgcaagc 
ggtaacattc 
gaacagggcg 
tccaacgcgg 
ggcgacacct 
gctgccgccc 
cccctgacag 
acccagtacc 
tcatggaccc 
ttgccagaca 
ccggtggtgg 
gtctactccc 
gagaaccaga 
cctgctctca 
gtgaccatta 
gtctcgccgc 
cccagcaata 
aagcgctccg 
cacaaacgcg 
gaggcgcgca 
accgtggtgc 
cgtcgccacc 
cgcgcacgtc 
attgtcactg 
agtgctatga 
ctgcgcgtgc 
gactcgtact 
aaaatcaaag 
gaagagcagg 
gatgatgaac 
cagtggaaag 
cccggtgagc 
gacctgcttg 
gacatgctgg 
ctgcagcagg 
tctggtgact 
gtcttggaaa 
aagcaggtgg 
agcaccagta 
gcggtggcgg 
gtgcaaacgg 
aagtacggcg 
acccccggct 
accaccactg 
gtgcgcaggg 
cccagcatcg 
ctccgtttcc 
cacggcctga 
cgcatgcgcg 
gtgcccggaa 
atgtggaaaa 
ttgtagaatg 
catgggaaac 
ctcgctgtgg 
ctggaacagc 
aaaggtggta 
agtgcaaaat 
ggccgtggag 
agaaactctg 
cctgcccacc 
cgtaacgctg 



ggcaaccctt 

ccgcactgtt 

ggggtggcgc 

cagccgcggc 

ttgccacacg 

ccgctgcgca 

aggacagcaa 

gcagctggta 

tgctttgcac 

tgatgcaaga 

gcgccgagct 

aactcatccg 

ttttggcgcg 

cagatcacgg 

ctgacgccag 

gcgtccfcatc 

acacaggctg 

accaacaccc 

gccgcactgg 

actacacgcc 

gcggagcccg 

gccgccgacc 

gcaccggccg 

tgccccccag 

ctcagggtcg 

ccgtgcgcac 

gttgtatgta 

aagagatgct 

attacaagcc 

ttgacgacga 

gtcgacgcgt 

gctccacccg 

agcaggccaa 

cgttgccgct 

tgctgcccgc 

tggcacccac 

aaatgaccgt 

cgccgggact 

ttgccaccgc 

atgccgcggt 

acccgtggat 

ccgccagcgc 

atcgtggcta 

gaacccgccg 

tggctcgcga 

tttaaaagcc 

cggtgccggg 

cgggcggcat 

gcggtatcct 

ttgcatccgt 

atcaaaataa 

gaagacatca 

tggcaagata 

agcggcatta 

agcacaggcc 

gatggcctgg 

aagattaaca 

acagtgtctc 

gtgacgcaaa 

acccgtccca 

gacctgcctc 



ccaggagggc 

ggatgtggac 

aggcggcagc 

aatgcagccg 

ggctgaggag 

acccgaggtc 

gaaacgcagt 

ccttgcatac 

tcctgacgta 

ccccgtgacc 

gttgcccgtg 

ccagtttacc 

cccgccagcc 

gacgctaccg 

acgccgcacc 

gagccgcact 

gggcctgcgc 

agtgcgcgtg 

gcgcaccacc 

cacgccgcca 

gcgctatgct 

cggcactgcc 

acgggcggcc 

gtccaggcga 

caggggcaac 

ccgccccccg 

tccagcggcg 

ccaggtcatc 

ccgaaagcta 

ggtggaactg 

aaaacgtgtt 

cacctacaag 

cgagcgcctc 

ggacgagggc 

gcttgcaccg 

cgtgcagctg 

ggaacctggg 

gggcgtgcag 

cacagagggc 

gcaggcggtc 

gtttcgcgtt 

gctactgccc 

cacctaccgc 

ccgccgtcgc 

aggaggcagg 

ggtctttgtg 

attccgagga 

gcgtcgtgcg 

gcccctcctt 

ggccttgcag 

aaagtctgga 

actttgcgtc 

tcggcaccag 

aaaatttcgg 

agatgctgag 

cctctggcat 

gtaagcttga 

cagaggggcg 

tagacgagcc 

tcgcgcccat 

cccccgccga 



tttaggatca 

gcctaccagg 

aacagcagtg 

gtggaggaca 

aagcgcgctg 

gagaagcctc 

tacaacctaa 

aactacggcg 

acctgcggct 

ttccgctcca 

cactccaaga 

tctctgaccc 

cccaccatca 

ctgcgcaaca 

tgcccctacg 

ttttgagcaa 

ttcccaagca 

cgcgggcact 

gtcgatgacg 

ccagtgtcca 

aaaatgaaga 

gcccaacgcg 

atgcgggccg 

cgagcggccg 

gtgtattggg 

cgcaactaga 

gcggcgcgca 

gcgccggaga 

aagcgggtca 

ctgcacgcta 

ttgcgacccg 

cgcgtgtatg 

ggggagtttg 

aacccaacac 

tccgaagaaa 

atggtaccca 

ctggagcccg 

accgtggacg 

atggagacac 

gctgcggccg 

tcagcccccc 

gaatatgccc 

cccagaagac 

cgtcgccagc 

accctggtgc 

gttcttgcag 

agaatgcacc 

caccaccggc 

attccactga 

gcgcagagac 

ctctcacgct 

tctggccccg 

caatatgagc 

ttccaccgtt 

ggataagttg 

tagcggggtg 

tccccgccct 

tggcgaaaag 

tccctcgtac 

ggctaccgga 

cacccagcag 



cctacgatga 
cgagcttgaa 
gcagcggcgc 
tgaacgatca 
aggccgaagc 
agaagaaacc 
taagcaatga 
accctcagac 
cggagcaggt 
cgcgccagat 
gcttctacaa 
acgtgttcaa 
ccaccgtcag 
gcatcggagg 
tttacaaggc 
gcatgtccat 
agatgtttgg 
accgcgcgcc 
ccatcgacgc 
cagtggacgc 
gacggcggag 
cggcggcggc 
ctcgaaggct 
ccgcagcagc 
tgcgcgactc 
ttgcaagaaa 
acgaagctat 
tctatggccc 
aaaagaaaaa 
ccgcgcccag 
gcaccaccgt 
atgaggtgta 
cctacggaaa 
ctagcctaaa 
agcgcggcct 
agcgccagcg 
aggtccgcgt 
ttcagafcacc 
aaacgtcccc 
cgtccaagac 
ggcgcccgcg 
tacatccttc 
gagcaactac 
ccgtgctggc 
tgccaacagc 
atatggccct 
gtaggagggg 

ggcggcgcgc 

tcgccgcggc 
actgattaaa 
cgcttggtcc 
cgacacggct 
ggtggcgcct 
aagaactatg 
aaagagcaaa 
gtggacctgg 
cccgtagagg 
cgtccgcgcc 
gaggaggcac 
gtgctgggcc 
aaacctgtgc 



16020 

16080 

16140 

16200 

16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

18600 

18660 

18720 

18780 

18840 

18900 

18960 

19020 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 
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tgccaggccc 
ccagcggtcc 
gcatcgtggg 
tgtcgtatgt 
gcgcccgctt 
ctcgggccag 
cgagacgtac 
cgtgaccaca 
tactgcgtac 
catggcttcc 
ctactctggc 
ggatgaagct 
cgaagtagac 
tggtataaat 
tgccgataaa 
aattaatcat 
ttcatatgca 
tggaaagcta 
caatggtgat 
cccagacact 
gggccaacaa 
tctaatgtat 
gaatgctgtt 
ttccattggt 
tccagatgtt 
tccactggga 
ggaaaatgga 
aaataatttt 
caacatagcg 
tgataaccca 
ctgctacatt 
taaccaccac 
tgtgcccttc 
gccgggctca 
ctccctagga 
ttacgccacc 
aaacgacacc 
tatacccgcc 
ccgcggctgg 
cgacccttat 
ccacaccttt 
ccgcctgctt 
cgttgcccag 
cattggctac 
tagaaacttc 
acaggtgggc 
catgcgcgaa 
agttgacagc 
ctccagtaac 
caactccgcc 
tctttatgtt 
catcgaaacc 
caagcaacat 
gtcaaagatc 
tttgtttctc 
ggcgtacact 
ccctttggct 
ctgcgccgta 
caaagcgtac 
gcctttgcca 
ggggtaccca 



gaccgccgtt 
gcgatcgttg 
tcfcgggggtg 
gtgtcatgta 
tccaagatgg 
gacgcctcgg 
ttcagcctga 
gaccggtccc 
tcgtacaagg 
acgtactttg 
actgcctaca 
gctactgctc 
gagcaagctg 
atfcacaaagg 
acatttcaac 
gcagctggga 
aaacccacaa 
gaaagtcaag 
aacttgactc 
catatttctt 
tctatgccca 
tacaacagca 
gtagatttgc 
gatagaacca 
agaattattg 
ggtgtgatta 
tgggaaaaag 
gccatggaaa 
ctgtatttgc 
aacacctacg 
aaccttggag 
cgcaatgctg 
cacatccagg 
tacacctacg 
aatgacctaa 
ttcttcccca 
aacgaccagt 
aacgctacca 
gccttcacgc 
tacacctact 
aagaaggtgg 
acccccaacg 
tgtaacatga 
cagggcttct 
cagcccatga 
atcctacacc 
ggacaggcct 
attacccaga 
tttatgtcca 
cacgcgctag 
ttgtttgaag 
gtgtacctgc 
caacaacagc 
ttggttgtgg 
cacacaagct 
ggatggcctt 
tttctgacca 
gcgccattgc 
aggggcccaa 
actggcccca 
actccatgct 



gttgtaaccc 
cggcccgtag 
caatccctga 
tgcgtccatg 
ctaccccttc 
agtacctgag 
ataacaagtt 
agcgtttgac 
cgcggttcac 
acatccgcgg 
acgccctggc 
ttgaaataaa 
agcagcaaaa 
agggtattca 
ctgaacctca 
gagtccttaa 
atgaaaatgg 
tggaaatgca 
ctaaagtggt 
acatgcccac 
acaggcctaa 
cgggtaatat 
aagacagaaa 
ggtacttttc 
aaaatcatgg 
atacagagac 
atgctacaga 
tcaatctaaa 
ccgacaagct 
actacatgaa 
cacgctggtc 
gcctgcgcta 
tgcctcagaa 
agtggaactt 
gggttgacgg 
tggcccacaa 
cctttaacga 
acgtgcccat 
gccttaagac 
ctggctctat 
ccattacctt 
agtttgaaat 
ccaaagactg 
atatcccaga 
gccgtcaggt 
aacacaacaa 
accctgctaa 
aaaagtttct 
tgggcgcact 
acatgacttt 
tctttgacgt 
gcacgccctt 
tgccgccatg 
gccatatttt 
cgcctgcgcc 
tgcctggaac 
gcgactcaag 
ttcttccccc 
ctcggccgcc 
aactcccatg 
caacagtccc 



gtcctagccg 
ccagtggcaa 
agcgccgacg 
tcgccgccag 
gatgatgccg 
ccccgggctg 
tagaaacccc 
gctgcggttc 
cctagctgtg 
cgtgctggac 
tcccaagggt 
cctagaagaa 
aactcacgta 
aataggtgtc 
aataggagaa 
aaagactacc 
agggcaaggc 
atttttctca 
attgtacagt 
tattaaggaa 
ttacattgct 

gggtgttctg 

cacagagctt 
tatgtggaat 
aactgaagat 
tcttaccaag 
attttcagat 
tgccaacctg 
aaagtacagt 
caagcgagtg 
ccttgactat 
ccgctcaatg 
gttctttgcc 
caggaaggat 
agccagcatt 
caccgcctcc 
ctatctctcc 
atccatcccc 
taaggaaacc 
accctaccta 
tgactcttct 
taagcgctca 
gttcctggta 
gagctacaag 
ggtggatgat 
ctctggattt 
cttcccctat 
ttgcgatcgc 
cacagacctg 
tgaggtggat 
ggtccgtgtg 
ctcggccggc 
ggctccagtg 
ttgggcacct 
atagtcaata 
ccgcactcaa 
caggtttacc 
gaccgctgta 
tgtggactat 
gatcacaacc 
caggtacagc 



cgcgtccctg 
ctggcaaagc 
atgcttctga 
aggagctgct 
cagtggtctt 
gtgcagtttg 
acggtggcgc 
atccctgtgg 
ggtgataacc 
aggggcccta 
gccccaaatc 
gaggacgatg 
tttgggcagg 
gaaggtcaaa 
tctcagtggt 
ccaatgaaac 
attcttgtaa 
actactgagg 
gaagatgtag 
ggtaactcac 
tttagggaca 
gcgggccaag 
tcataccagc 
caggctgttg 
gaacttccaa 
gtaaaaccta 
aaaaatgaaa 
tggagaaatt 
ccttccaacg 
gtggctcccg 
atggacaacg 
ttgctgggca 
attaaaaacc 
gttaacatgg 
aagtttgata 
acgcttgagg 
gccgccaaca 
tcccgcaact 
ccatcactgg 
gatggaacct 
gtcagctggc 
gttgacgggg 
caaatgctag 
gaccgcatgt 
actaaataca 
gttggctacc 
ccgcttatag 
accctttggc 
ggccaaaacc 
cccatggacg 
caccggccgc 
aacgccacaa 
agcaggaact 
atgacaagcg 
cggccggtcg 
aaacatgcta 
agtttgagta 
taacgctgga 
tctgctgcat 
ccaccatgaa 
ccaccctgcg 



cgccgcgccg 
acactgaaca 
atagctaacg 
gagccgccgc 
acatgcacat 
cccgcgccac 
ctacgcacga 
accgtgagga 
gtgtgctgga 
cttttaagcc 
cttgcgaatg 
acaacgaaga 
cgccttattc 
cacctaaata 
acgaaactga 
catgttacgg 
agcaacaaaa 
cgaccgcagg 
atatagaaac 
gagaactaat 
attttattgg 
catcgcagtt 
ttttgcttga 
acagctatga 
attactgctt 
aaacaggtca 
taagagttgg 
tcctgtactc 
taaaaatttc 
ggttagtgga 
tcaacccatt 
atggtcgcta 
tccttctcct 
ttctgcagag 
gcatttgcct 
ccatgcttag 
tgctctaccc 
gggcggcttt 
gctcgggcta 
tttacctcaa 
ctggcaatga 
agggttacaa 
ctaactacaa 
actccttctt 
aggactacca 
ttgcccccac 
gcaagaccgc 
gcatcccatt 
ttctctacgc 
agcccaccct 
accgcggcgt 
cataaagaag 
gaaagccatt 
ctttccaggc 
cgagactggg 
cctctttgag 
cgagtcactc 
aaagtccacc 
gtttctccac 
ccttattacc 
tcgcaaccag 



19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
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gaacagctct acagcttcct ggagcgccac 
attaggagcg ccacttcttt ttgtcacttg 
actttcaata aaggcaaatg cttttatttg 
cttgccgtct gcgccgtttg gggaggcggc 
atggttgggg gacgtcgcgc cgcaccgcgt 
tcttcccgac tggccatttc cttctcctat 
aagaaggaca gcctaaccgc cccctctgag 
aacgcgccta ccaccttccc cgtcgaggca 
gagcaggacc caggttttgt aagcgaagac 
aaaaagcaag accaggacaa cgcagaggca 
aggcatggcg actacctaga tgtgggagac 
tgcgccatta tctgcgacgc gttgcaagag 
gtcagccttg cctacgaacg ccacctattc 
aacggcacat gcgagcccaa cccgcgcctc 
gtgcttgcca cctatcacat ctttttccaa 
aaccgcagcc gagcggacaa gcagctggcc 
gcctcgctca acgaagtgcc aaaaatcttt 
gcaaacgctc tgcaacagga aaacagcgaa 
ctcgagggtg acaacgcgcg cctagccgta 
gcctacccgg cacttaacct accccccaag 
gtgcgccgtg cgcagcccct ggagagggat 
ctacccgcag ttggcgacga gcagctagcg 
ttggaggagc gacgcaaact aatgatggcc 
atgcagcggt tctttgctga cccggagatg 
acctttcgac agggctacgt acgccaggcc 
ctggtctcct accttggaat tttgcacgaa 
acgctcaagg gcgaggcgcg ccgcgactac 
tacacctggc agacggccat gggcgtttgg 
gagctgcaga aactgctaaa gcaaaacttg 
tccgtggccg cgcacctggc ggacatcatt 
cagggtctgc cagacttcac cagtcaaagc 
gagcgctcag gaatcttgcc cgccacctgc 
aagtaccgcg aatgccctcc gccgctttgg 
taccttgcct accactctga cataatggaa 
cactgtcgct gcaacctatg caccccgcac 
aacgaaagtc aaattatcgg tacctttgag 
gcggctccgg ggttgaaact cactccgggg 
gtacctgagg actaccacgc ccacgagatt 
aatgcggagc ttaccgcctg cgtcattacc 
atcaacaaag cccgccaaga gtttctgcta 
cagtccggcg aggagctcaa cccaatcccc 
cgggcccttg cttcccagga tggcacccaa 
ggacgaggag gaatactggg acagtcaggc 
catgatggaa gactgggaga gcctagacga 
cgaaacaccg tcaccctcgg tcgcattccc 
ttccagcatg gctacaacct ccgctcctca 
caaccgtaga tgggacacca ctggaaccag 
agcccaagag caacaacagc gccaaggcta 
agttgcttgc ttgcaagact gtgggggcaa 
ccatcacggc gtggccttcc cccgfcaacat 
atactgcacc ggcggcagcg gcagcggcag 
gaccggatag caagactctg acaaagccca 
gaggagcgct gcgtctggcg cccaacgaac 
tttttcccac tctgtatgct atatttcaac 
taaaaaacag gtctctgcga tccctcaccc 
agcttcggcg cacgctggaa gacgcggagg 
ttaaggacta gtttcgcgcc ctttctcaaa 
ggccacaccc ggcgccagca cctgtcgtca 
cctacatgtg gagttaccag ccacaaatgg 
caacccgaat aaactacatg agcgcgggac 
gcgcccaccg aaaccgaatt ctcttggaac 



tcgccctact tccgcagcca cagtgcgcag 23340 
aaaaacatgt aaaaataatg tactagagac 23400 
tacactctcg ggtgattatt tacccccacc 23460 
ggcgacgggg acggggacga cacgtcctcc 23520 
ccgcgctcgg gggtggtttc gcgctgctcc 23580 
aggcagaaaa agatcatgga gtcagtcgag 23640 
ttcgccacca ccgcctccac cgatgccgcc 23700 
cccccgcttg aggaggagga agtgattatc 23760 
gacgaggacc gctcagtacc aacagaggat 23820 
aacgaggaac aagtcgggcg gggggacgaa 23880 
gacgtgctgt tgaagcatct gcagcgccag 23940 
cgcagcgatg tgcccctcgc catagcggat 24000 
tcaccgcgcg taccccccaa acgccaagaa 24060 
aacttctacc ccgtatttgc cgtgccagag 24120 
aactgcaaga tacccctatc ctgccgtgcc 24180 
ttgcggcagg gcgctgtcat acctgatatc 24240 
gagggtcttg gacgcgacga gaagcgcgcg 24300 
aatgaaagtc actctggagt gttggtggaa 24360 
ctaaaacgca gcatcgaggt cacccacttt 24420 
gtcatgagca cagtcatgag tgagctgatc 24480 
gcaaatttgc aagaacaaac agaggagggc 24540 
cgctggcttc aaacgcgcga gcctgccgac 24600 
gcagtgctcg ttaccgtgga gcttgagtgc 24660 
cagcgcaagc tagaggaaac attgcactac 24720 
tgcaagatct ccaacgtgga gctctgcaac 24780 
aaccgccttg ggcaaaacgt gcttcattcc 24840 
gtccgcgact gcgtttactt atttctatgc 24900 
cagcagtgct tggaggagtg caacctcaag 24960 
aaggacctat ggacggcctt caacgagcgc 25020 
ttccccgaac gcctgcttaa aaccctgcaa 25080 
atgttgcaga actttaggaa ctttatccta 25140 
tgtgcacttc ctagcgactt tgtgcccatt 25200 
ggccactgct accttctgca gctagccaac 25260 
gacgtgagcg gtgacggtct actggagtgt 25320 
cgctccctgg tttgcaattc gcagctgctt 25380 
ctgcagggtc cctcgcctga cgaaaagtcc 25440 
ctgtggacgt cggcttacct tcgcaaattt 25500 
aggttctacg aagaccaatc ccgcccgcca 25560 
cagggccaca ttcttggcca attgcaagcc 25620 
cgaaagggac ggggggttta cttggacccc 25680 
ccgccgccgc agccctatca gcagcagccg 25740 
aaagaagctg cagctgccgc cgccacccac 25800 
agaggaggtt ttggacgagg aggaggagga 25860 
ggaagcttcc gaggtcgaag aggtgtcaga 25920 
ctcgccggcg ccccagaaat cggcaaccgg 25980 
ggcgccgccg gcactgcccg ttcgccgacc 26040 
ggccggtaag tccaagcagc cgccgccgtt 26100 
ccgctcatgg cgcgggcaca agaacgccat 26160 
catctccttc gcccgccgct ttcttctcta 26220 
cctgcattac taccgtcatc tctacagccc 26280 
caacagcagc ggccacacag aagcaaaggc 26340 
agaaatccac agcggcggca gcagcaggag 26400 
ccgtatcgac ccgcgagctt agaaacagga 26460 
agagcagggg ccaagaacaa gagctgaaaa 26520 
gcagctgcct gtatcacaaa agcgaagatc 26580 
ctctcttcag taaatactgc gcgctgactc 26640 
tttaagcgcg aaaactacgt catctccagc 26700 
gcgccattat gagcaaggaa attcccacgc 26760 
gacttgcggc tggagctgcc caagactact 26820 
cccacatgat atcccgggtc aacggaatcc 26880 
aggcggctat taccaccaca cctcgtaata 26940 
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accttaatcc 
ctgtggtact 
ttgcgggcgg 
tcagagggcg 
cggacgggac 
tcctaactct 
ttattgagga 
atccggatca 
gaatgttaag 
acaagtgctt 
tcgagggccc 
ttcgggagtt 
ctgtgatttg 
ctgagtataa 
gccaccgtct 
tctccctctg 
tccgagctca 
agtgcgtcac 
acagacctca 
ttaggccaaa 
ttctaattca 
gacgcagggc 
tgcaccagtg 
gtaataccac 
tcatggtggg 
ttcactcacc 
tcaaagatct 
tcagttagca 
tggtattgca 
tcctcctgtt 
agaccgtctg 
actgtgcctt 
ggggtactct 
aaaatgggca 
actgtgagcc 
ctcacagtta 
aacacactca 
gccacccaag 
ctcaccacca 
actggtagct 
ctaaagtacg 
ggtccaggtg 
tttgattcac 
aacagacgcc 
agactaggac 
aaaggccttt 
actgccaagg 
gaatttggtt 
ctagaatttg 
agcacaggtg 
ccagctccat 
ttaacaaaat 
ttggctccaa 
ggagtgctac 
cttactgaag 
ccaaaatctc 
gacaaaacta 
acaactccaa 
aatgaaatat 
gtttgtgtta 
attcagtagt 



ccgtagttgg 
tcccagagac 
ctttcgtcac 
aggtattcag 
atttcagatc 
gcagacctcg 
gtttgtgcca 
atttattcct 
tggagaggca 
tgcccgcgac 
ggcgcacggc 
tacccagcgc 
caactgtcct 
taaatacaga 
tcacccgccc 
tgatttacaa 
gctactccat 
cggccgctgc 
ataactctgt 
ggcgcagcta 
ggtttctcta 
agcggccgag 
caaaaggggt 
cggacaccgc 
agaaaagccc 
ttgtcaagga 
tattcccttt 
aatttctgtc 
gcttcctcct 
cctgtccatc 
aagatacctt 
ttcttactcc 
ctttgcgcct 
acggcctctc 
cacctctcaa 
cctcagaagc 
ccatgcaatc 
gacccctcac 
ccgatagcag 
tgggcattga 
gggctccttt 
tgactattaa 
aaggcaatat 
ttatacttga 
agggccctct 
acttgtttac 
ggttgatgtt 
cacctaatgc 
attcaaacaa 
ccattacagt 
ctcctaactg 
gtggcagtca 
tatctggaac 
taaacaattc 
gcacagccta 
acggtaaaac 
aacctgtaac 
gtgcatactc 
ttgccacatc 
tgtttcaacg 
atagccccac 



cccgctgccc 
gcccaggccg 
agggtgcggt 
ctcaacgacg 

ggcggcgccg 

tcctctgagc 
tcggtctact 
aactttgacg 
gagcaactgc 
tccggtgagt 
gtccggctta 
cccctgctag 
aaccttggat 
aattaaaata 
aagcaaacca 
cagtttcaac 
cagaaaaaac 
accacaccta 
ttaccagaac 
ctgtggggtt 
gaaatggacg 
caacagcgca 
atcttttgtc 
cttagctaca 
attaccataa 
cctgaggatc 
aactaataaa 
cagtttattc 
ggctgcaaac 
cgcacccact 
caaccccgtg 
tccctttgta 
atccgaacct 
tctggacgag 
aaaaaccaag 
cctaactgtg 
acaggccccg 
agtgtcagaa 
tacccttact 
cttgaaagag 
gcatgtaaca 
taatacttcc 
gcaacttaat 
tgttagttat 
ttttataaac 
agcttcaaac 
tgacgctaca 
accaaacaca 
ggctatggtt 
aggaaacaaa 
tagactaaat 
aatacttgct 
agttcaaagt 
cttcctggac 
tacaaacgct 
tgccaaaagt 
actaaccatt 
tatgtcattt 
ctcttacact 
tgtttatttt 
caccacatag 



tggtgtacca 
aagttcagat 
cgcccgggca 
agtcggtgag 
gccgtccttc 
cgcgctctgg 
ttaacccctt 
cggtaaagga 
gcctgaaaca 
tttgctactt 
ccgcccaggg 
ttgagcggga 
tacatcaaga 
tactggggct 
aggcgaacct 
ccagacggag 
accaccctcc 
ccgcctgacc 
aggaggtgag 
tatgaacaat 
gaattattac 
tgaatcaaga 
tggtaaagca 
agttgccaac 
ctcagcactc 
tctgcaccct 
aaaaaataat 
agcagcacct 
tttctccaca 
atcttcatgt 
tatccatatg 
tcccccaatg 
ctagttacct 
gccggcaacc 
tcaaacataa 
gctgccgccg 
ctaaccgtgc 
ggaaagctag 
atcactgcct 
cccatttata 
gacgacctaa 
ttgcaaacta 
gtagcaggag 
ccgtttgatg 
tcagcccaca 
aattccaaaa 
gccatagcca 
aatcccctca 
cctaaactag 
aataatgata 
gcagagaaag 
acagtttcag 
gctcatctta 
ccagaatatt 
gttggattta 
aacattgtca 
acactaaacg 
tcatgggact 
ttttcataca 
tcaattgcag 
cttatacaga 



ggaaagtccc 
gactaactca 
gggtataact 
ctcctcgctt 
attcacgcct 
aggcattgga 
ctcgggacct 
ctcggcggac 
cctggtccac 
tgaattgccc 
agagcttgcc 
caggggaccc 
tctttgttgc 
cctatcgcca 
tacctggtac 
tgagtctacg 
ttacctgccg 
gtaaaccaga 
cttagaaaac 
tcaagcaact 
agagcagcgc 
gctccaagac 
ggccaaagtc 
caagcgtcag 
ggtagaaacc 
tattaagacc 
aaagcatcac 
ccttgccctc 
atctaaatgg 
tgttgcagat 
acacggaaac 
ggtttcaaga 
ccaatggcat 
ttacctccca 
acctggaaat 
cacctctaat 
acgactccaa 
ccctgcaaac 
caccccctct 
cacaaaatgg 
acactttgac 
aagttactgg 
gactaaggat 
ctcaaaacca 
acttggatat 
agcttgaggt 
ttaatgcagg 
aaacaaaaat 
gaactggcct 
agctaacttfc 
atgctaaact 
ttttggctgt 
ttataagatt 
ggaactttag 
tgcctaacct 
gtcaagttta 
gtacacagga 
ggtctggcca 
ttgcccaaga 
aaaatttcaa 
tcaccgtacc 



gctcccacca 
ggggcgcagc 
cacctgacaa 
ggtctccgtc 
cgtcaggcaa 
actctgcaat 
cccggccact 
ggctacgact 
tgtcgccgcc 
gaggatcata 
cgtagcctga 
tgtgttctca 
catctctgtg 
tcctgtaaac 
ttttaacatc 
agagaacctc 
ggaacgtacg 
ctttttccgg 
ccttagggta 
ctacgggcta 
ctgctagaaa 
atggttaact 
acctacgaca 
aaafctggtgg 
gaaggctgca 
ctgtgcggtc 
ttacttaaaa 
ctcccagctc 
aatgtcagtt 
gaagcgcgca 
cggtcctcca 
gagtccccct 
gcttgcgctc 
aaatgtaacc 
atctgcaccc 
ggtcgcgggc 
acttagcatt 
atcaggcccc 
aactactgcc 
aaaactagga 
cgtagcaact 
agccttgggt 
tgattctcaa 
actaaatcta 
taactacaac 
taacctaagc 
agatgggctt 
tggccatggc 
tagttttgac 
gtggaccaca 
cactttggtc 
taaaggcagt 
tgacgaaaat 
aaatggagat 
atcagcttat 
cttaaacgga 
aacaggagac 
caactacatt 
ataaagaatc 
gtcatttttc 
ttaatcaaac 



27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
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tcacagaacc 

ttctccccgg 

tatattccac 

gggcagctca 

ttgcggttgc 

atcgtgcatc 

ccgctccgtc 

ccgcagcata 

agcacagtaa 

gtatccaaag 

gtagattaag 

gttgtaattc 

caccatccta 

actggaacaa 

gatatcaatg 

ctcccgcgtt 

actgcaggga 

cagcagcgga 

atccctactg 

aaatggaacg 

cagatctgcg 

actctctcaa 

ccgctgccct 

cgttctgcga 

tattccaaaa 

ccggtggcgt 

acaatggctt 

tcagggtgaa 

tctcgccacc 

aaaatctgct 

attcaggttc 

gatcccgtag 

gcgcggccac 

tactcggagc 

ataaaatgca 

tcgtagtcat 

accatttttc 

aaaacattta 

gacggactac 

ccaccgacag 

gtt gat teat 

aggegtagag 

aacacataaa 

acaacataca 

cctattaaaa 

gecaagtgea 

aaaacaccca 

acttcctcaa 

tacaattccc 

cacgccccgc 

ataaggtata 



ctagtattca 
ctggccttaa 
aeggtttect 
cttaagttca 
ttaacgggcg 
aggatagggc 
ctgcaggaat 
aggegecttg 
ctgcagcaca 
etcatggegg 
tggcgacccc 
accacctccc 
aaccagctgg 
tgacagtgga 
ttggcacaac 
agaaccatat 
agacctcgca 
tgatcctcca 
tacggagtgc 
ceggaegtag 
tctccggtct 
agcatccagg 
gataacatcc 
gtcacacacg 
gattatccaa 
ggtcaaactc 
ccaaaaggca 
tctcctctat 
ttctcaatat 
ccagagcgcc 
ctcacagacc 
gtcccttcgc 
ttccccgcca 
tatgetaace 
aggtgetget 
getcatgeag 
tctcaaacat 
aacattagaa 
ggccatgccg 
ctcctcggtc 
cggtcagtgc 
acaacattac 
cacctgaaaa 
gcgcttcaca 
aaacaccact 
gagegagtat 
gaaaaccgea 
ategtcaett 
aacacataca 
gccacgtcac 
ttattgatga 



acctgccacc 
aaagcatcat 
gtcgagccaa 
tgtcgctgtc 
gcgaaggaga 
ggtggtgctg 
acaacatggc 
tcctccgggc 
gcaccacaat 
ggaccacaga 
tcataaacac 
ggtaccatat 
ccaaaacctg 
gageccagga 
acaggcacac 
cccagggaac 
cgtaactcac 
gtatggtagc 
gecgagacaa 
tcatatttcc 
cgccgcttag 
cgccccctgg 
accaccgcag 
ggaggagegg 
aacctcaaaa 
tacagecaaa 
aacggccctc 
aaacattcca 
atctctaagc 
ctccaccttc 
tgtataagat 
agggecaget 
ggaaccttga 
agcgtagccc 
caaaaaatca 
ataaaggcag 
gtctgcgggt 
gectgtctta 
gcgtgaccgt 
atgtccggag 
taaaaagega 
agcccccata 
accctcctgc 
gcggcagcct 
cgacacggca 
atataggact 
cgcgaaccta 
ccgttttccc 
agttactccg 
aaactccacc 
tg 



tccctcccaa 
atcatgggta 
acgctcatca 
cagctgetga 
agtccacgcc 
cagcagcgcg 
agtggtctcc 
acagcagcgc 
attgttcaaa 
acccacgtgg 
gctggacata 
aaacctctga 
cccgccggct 
ctcgtaacca 
gtgeatacac 
aacccattcc 
gttgtgcatt 
gcgggtttct 
ccgagatcgt 
tgaagcaaaa 
ategctctgt 
ettegggtte 
aataagecac 
gaagagctgg 
tgaagatcta 
gaacagataa 
aegtccaagt 
gcaccttcaa 
aaatcccgaa 
agcctcaagc 
teaaaagegg 
gaacataatc 
caaaagaacc 
cgatgtaagc 
ggcaaagect 
gtaagctccg 
ttctgcataa 
caacaggaaa 
aaaaaaactg 
tcataatgta 
ccgaaatagc 
ggaggtataa 
ctaggcaaaa 
aacagtcagc 
ccagctcaat 
aaaaaatgac 
cgcccagaaa 
acgttacgta 
ccctaaaacc 
ccctcattat 



cacacagagt 
acagacatat 
gtgatattaa 
gccacaggct 
tacatggggg 
cgaataaact 
teagegatga 
accctgatct 
atcccacagt 
ccatcatacc 
aacattacct 
ttaaacatgg 
atacactgea 
tggatcatca 
ttcctcagga 
tgaatcagcg 
gtcaaagtgt 
gtctcaaaag 
gttggtcgta 
ecaggtgegg 
gtagtagttg 
tatgtaaact 
acccagccaa 
aagaaccatg 
ttaagtgaac 
tggcatttgt 
ggacgtaaag 
ccatgcccaa 
tattaagtcc 
agegaatcat 
aacattaaca 
gtgeaggtet 
cacactgatt 
tttgttgcat 
cgcgcaaaaa 
gaaccaccac 
acacaaaata 
aacaaccctt 
gtcaccgtga 
agacteggta 
ccgggggaat 
caaaattaat 
tagcaccctc 
cttaccagta 
cagtcacagt 
gtaacggtta 
egaaagecaa 
acttcccatt 
tacgtcaccc 
catattggct 



acacagtcct 
tcttaggtgt 
taaactcccc 
gctgtccaac 
tagagtcata 
gctgccgccg 
ttcgcaccgc 
cacttaaatc 
geaaggeget 
acaagegcag 
cttttggcat 
cgccatccac 
gggaaccggg 
tgetegtcat 
ttacaagctc 
taaatcccac 
tacatteggg 
gaggtagacg 
gtgtcatgcc 
gcgtgacaaa 
tagtatatcc 
ccttcatgcg 
cctacacatt 
tttttttttt 
gcgctcccct 
aagatgttgc 
gctaaaccct 
ataattctca 
ggccattgta 
gattgeaaaa 
aaaatacege 
gcacggacca 
atgacacgea 
gggeggegat 
agaaagcaca 
agaaaaagac 
aaataacaaa 
ataagcataa 
ttaaaaagca 
aacacatcag 
acatacccgc 
aggagagaaa 
ccgctccaga 
aaaaagaaaa 
gtaaaaaagg 
aagtccacaa 
aaaacccaca 
ttaagaaaac 
gccccgttcc 
tcaatccaaa 



30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33622 



<210> 45 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> 5F KOI 



<400> 45 
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atgaagcgcg 
accggtcctc 
gagagtcccc 
atgcttgcgc 
caaaatgtaa 
atatctgcac 
atggtcgcgg 
aaacttagca 
acatcaggcc 
ctaactactg 
ggaaaactag 
accgtagcaa 
ggagccttgg 
attgattctc 
caactaaatc 
attaactaca 
gttaacctaa 
ggagatgggc 
attggccatg 
cttagttttg 
ttgtggacca 
ctcactttgg 
gttaaaggca 
tttgacgaaa 
agaaatggag 
ctatcagctt 
tacttaaacg 
gaaacaggag 
cacaactaca 
gaataa 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
acacaactcc 
ttaatgaaat 



tgaagatacc 
ttttcttact 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
agaggctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
aagtgcatac 
atttgccaca 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
tctatgtcat 
tcctcttaca 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
tttcatggga 
ctttttcata 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
ctggtctggc 
cattgcccaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 



<210> 46 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOI 



<400> 46 
Met Lys Arg 
1 

Tyr Asp Thr 

Phe Val Ser 
35 

Leu Arg Leu 
50 

Lys Met Gly 
65 

Gin Asn Val 

lie Asn Leu 

Thr Val Ala 
115 

Met Gin Ser 

130 
Ala Thr Gin 
145 

Thr Ser Gly 



Ala Arg Pro 
5 

Glu Thr Gly 
20 

Pro Asn Gly 

Ser Glu Pro 

Asn Gly Leu 
70 

Thr Thr Val 
85 

Glu lie Ser 
100 

Ala Ala Ala 

Gin Ala Pro 

Gly Pro Leu 
150 

Pro Leu Thr 



Ser Glu Asp 

Pro Pro Thr 
25 

Phe Gin Glu 
40 

Leu Val Thr 
55 

Ser Leu Asp 

Ser Pro Pro 

Ala Pro Leu 
105 

Pro Leu Met 

120 
Leu Thr Val 
135 

Thr Val Ser 
Thr Thr Asp 



Thr Phe 
10 

Val Pro 

Ser Pro 

Ser Asn 

Glu Ala 

75 
Leu Lys 
90 

Thr Val 

Val Ala 

His Asp 

Glu Gly 
155 
Ser Ser 



Asn 

Phe 

Pro 

Gly 

60 

Gly 

Lys 

Thr 

Gly 

Ser 
140 
Lys 

Thr 



Pro 

Leu 

Gly 

45 

Met 

Asn 

Thr 

Ser 

Asn 
125 
Lys 

Leu 

Leu 



Val Tyr 

15 
Thr Pro 
30 

Val Leu 

Leu Ala 

Leu Thr 

Lys Ser 

95 
Glu Ala 
110 

Thr Leu 
Leu Ser 
Ala Leu 
Thr lie 



Pro 

Pro 

Ser 

Leu 

Ser 

80 

Asn 

Leu 

Thr 

lie 

Gin 
160 
Thr 



WO 03/062400 



PCT/US03/02295 



-35- 



165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 

180 185 190 

Lvs Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Glv Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Glv Glv Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Glv Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lvs Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 3X0 315 

Val Asn Leu Ser Thr Ala Lys Gly 'Leu Met Phe Asp Ala Thr Ala He 

325 330 33 5 

Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 

Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 

405 410 415 

Lvs Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Glv Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Atq Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

515 520 525 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 

530 535 540 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 

565 570 575 

Tyr He Ala Gin Glu 
580 



<210> 47 
<211> 1776 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> 5F K01RGD 



<400> 47 

atgaagcgcg 

accggtcctc 

gagagtcccc 

atgcttgcgc 

caaaatgtaa 

atatctgcac 

atggtcgcgg 

aaacttagca 

acatcaggcc 

ctaactactg 

ggaaaactag 

accgtagcaa 

ggagccttgg 

attgattctc 

caactaaatc 

attaactaca 

gttaacctaa 

ggagatgggc 

attggccatg 

cttagttttg 

ttgtggacca 

ctcactttgg 

gttaaaggca 

tttgacgaaa 

agaaatggag 

ctatcagctt 

tacttaaacg 

gaaacaggtg 

tctatgtcat 

tcctcttaca 



caagaccgtc 
caactgtgcc 
ctggggtact 
tcaaaatggg 
ccactgtgag 
ccctcacagt 
gcaacacact 
ttgccaccca 
ccctcaccac 
ccactggtag 
gactaaagta 
ctggtccagg 
gttttgattc 
aaaacagacg 
taagactagg 
acaaaggcct 
gcactgccaa 
ttgaatttgg 
gcctagaatt 
acagcacagg 
caccagctcc 
tcttaacaaa 
gtttggctcc 
atggagtgct 
atcttactga 
atccaaaatc 
gagacaaaac 
atcattgtga 
tttcatggga 
ctttttcata 



tgaagatacc 
ttttcttact- 
ctctttgcgc 
caacggcctc 
cccacctctc 
tacctcagaa 
caccatgcaa 
aggacccctc 
caccgatagc 
cttgggcatt 
cggggctcct 
tgtgactatt 
acaaggcaat 
ccttatactt 
acagggccct 
ttacttgttt 
ggggttgatg 
ttcacctaat 
tgattcaaac 
tgccattaca 
atctcctaac 
atgtggcagt 
aatatctgga 
actaaacaat 
aggcacagcc 
tcacggtaaa 
taaacctgta 
ttgtcgtggt 
ctggtctggc 
cattgcccaa 



ttcaaccccg 
cctccctttg 
ctatccgaac 
tctctggacg 
aaaaaaacca 
gccctaactg 
tcacaggccc 
acagtgtcag 
agtaccctta 
, gacttgaaag 
ttgcatgtaa 
aataatactt 
atgcaactta 
gatgttagtt 
ctttttataa 
acagcttcaa 
tttgacgcta 
gcaccaaaca 
aaggctatgg 
gtaggaaaca 
tgtagactaa 
caaatacttg 
acagttcaaa 
tccttcctgg 
tatacaaacg 
actgccaaaa 
acactaacca 
gattgttttt 
cacaactaca 
gaataa 



tgtatccata 
tatcccccaa 
ctctagttac 
aggccggcaa 
agtcaaacat 
tggctgccgc 
cgctaaccgt 
aaggaaagct 
ctatcactgc 
agcccattta 
cagacgacct 
ccttgcaaac 
atgtagcagg 
atccgtttga 
actcagccca 
acaattccaa 
cagccatagc 
caaatcccct 
ttcctaaact 
aaaataatga 
atgcagagaa 
ctacagtttc 
gtgctcatct 
acccagaata 
ctgttggatt 
gtaacattgt 
ttacactaaa 
gtacaactcc 
ttaatgaaat 



tgacacggaa 
tgggtttcaa 
ctccaatggc 
ccttacctcc 
aaacctggaa 
cgcacctcta 
gcacgactcc 
agccctgcaa 
ctcaccccct 
tacacaaaat 
aaacactttg 
taaagttact 
aggactaagg 
tgctcaaaac 
caacttggat 
aaagcttgag 
cattaatgca 
caaaacaaaa 
aggaactggc 
taagctaact 
agatgctaaa 
agttttggct 
tattataaga 
ttggaacttt 
tatgcctaac 
cagtcaagtt 
cggtacacag 
aagtgcatac 
atttgccaca 



60 

120 

180 

240 

300 

360 

420 . 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1776 



<210> 48 
<211> 591 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> 5F KOIRGD 



<400> 48 




Met Lys 


Arg 


Ala 


1 






Tyr Asp 


Thr 


Glu 






20 


Phe Val 


Ser 


Pro 




35 




Leu Arg 


Leu 


Ser 


50 






Lys Met 


Gly 


Asn 


65 






Gin Asn 


Val 


Thr 


lie Asn 


Leu 


Glu 






100 


Thr Val 


Ala 


Ala 




115 




Met Gin 


Ser 


Gin 



130 



Arg 


Pro 


Ser 


Glu 


5 








Thr 


Gly 


Pro 


Pro 


Asn 


Gly 


Phe 


Gin 








40 


Glu 


Pro 


Leu 


Val 






55 




Gly 


Leu 


Ser 


Leu 




70 






Thr 


Val 


Ser 


Pro 


85 








lie 


Ser 


Ala 


Pro 


Ala 


Ala 


Pro 


Leu 








120 


Ala 


Pro 


Leu 


Thr 






135 





Asp 


Thr 


Phe Asn 




10 




Thr 


Val 


Pro Phe 


25 






Glu 


Ser 


Pro Pro 


Thr 


Ser 


Asn Gly 






60 


Asp 


Glu 


Ala Gly 






75 


Pro 


Leu 


Lys Lys 




90 




Leu 


Thr 


Val Thr 


105 






Met 


Val 


Ala Gly 


Val 


His 


Asp Ser 






140 



Pro 


Val 


Tyr 


Pro 






15 




Leu 


Thr 


Pro 


Pro 




30 






Gly 


Val 


Leu 


Ser 


45 








Met 


Leu 


Ala 


Leu 


Asn 


Leu 


Thr 


Ser 








80 


Thr 


Lys 


Ser 


Asn 






95 




Ser 


Glu 


Ala 


Leu 




110 






Asn 


Thr 


Leu 


Thr 


125 








Lys 


Leu 


Ser 


lie 
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Ala Thr 


Gin 


Glv 


Pro 


Leu 


145 








150 


Thr Ser Gly 


Pro 


Leu 


Thr 








165 




Ala Ser 


pro 


Pro 


Leu 


Thr 






180 






Lys Glu 


JTX 


He 


Tyr 


Thr 










Ala jb*ro 


Leu 


His 


Val 


Thr 












laJ_y riU 




Val 


Thr 


He 


225 








230 


Gly Ala 


Leu 


Glv 


Phe 


Asp 






245 




Gly Gly 


Leu 


Arcr 


He 


Asp 




260 






Ser Tyr 


Pro 


Phe 


Asp 


Ala 


275 








Gly Pro 


Leu 


Phe 


He 


Asn 


290 








Phe 


Lys Gly 


Leu 


Tvr 


Leu 


305 








310 


Val Asn 


Leu 


Ser 


Thr 


Ala 








325 




Ala He 


Asn 


Ala 


Gly 


Asp 






340 






Asn Thr 


Asn 


Pro 


Leu 


Lys 




355 








Ser Asn 


Lys 


Ala 


Met 


Val 


370 










Ser Thr Gly 


Ala 


He 


Thr 


385 








390 


Leu Trp 


Thr 


Thr 


Pro 


Ala 






405 




Lys Asp 


Ala 


Lys 


Leu 


Thr 






420 






Leu Ala 


Thr 


Val 


Ser 


Val 




435 








Ser Gly Thr 


Val 


Gin 


Ser 


450 










Gly Val 


Leu 


Leu 


Asn 


Asn 


465 








470 


Arg Asn Gly 


Asp 


Leu 


Thr 








485 




Phe Met 


Pro 


Asn 


Leu 


Ser 






500 






.Lys Ser 


Asn 


He 


Val 


Ser 


515 








Pro Val 


Thr 


Leu 


Thr 


He 


530 








Gly 


His Cys 


Asp 


Cys 


Arg 


545 








550 


Ser Met 


Ser 


Phe 


Ser 


Trp 








565 




He Phe 


Ala 


Thr 


Ser 


Ser 



155 160 
Thr Thr Asp Ser Ser Thr Leu Thr He Thr 

170 175 
Thr Ala Thr Gly Ser Leu Gly He Asp Leu 

185 190 
Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

200 205 
Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 
215 220 

Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
235. 240 
Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

250 255 
Ser Gin Asn Arg Arg Leu He Leu Asp Val 

265 270 
Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

280 285 
Ser Ala His Asn Leu Asp He Asn Tyr Asn 
295 * 300 

Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
315 . 320 
Lys Gly Leu Met Phe Asp Ala Thr Ala He 

330 335 
Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

345 350 
Thr Lys He Gly His Gly Leu Glu Phe Asp 

360 365 
Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 
375 380 

Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
395 400 
Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 

410 415 
Leu Val Leu Thr Lys Cys Gly Ser Gin He 

425 430 
Leu Ala Val Lys Gly Ser Leu Ala Pro He 

440 445 
Ala His Leu He He Arg Phe Asp Glu Asn 
455 , 460 

Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
475 480 
Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

490 495 
Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

505 510 
Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

520 525 
Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
535 540 

Asp Cys Phe Cys Thr Thr Pro Ser Ala Tyr 
555 560 
Asp Trp Ser Gly His Asn Tyr He Asn Glu 

570 575 
Tyr Thr Phe Ser Tyr He Ala Gin Glu 
580 585 590 



<210> 49 
<211> 1746 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> 5F KO!2 
<400> 49 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 120 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 
caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa 3 00 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 420 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 480 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 600 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 720 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 840 
caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 900 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 1020 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 1080 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 1200 
ttgtggacca caccagctcc atctcctaac tgttcactaa atggaggcgg agatgctaaa 1260 
ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct 1320 
gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga 1380 
tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt 1440 
agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac 1500 
ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt 1560 
tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag 1620 
gaaacaggag acacaactcc aagtgcatac tctatgtcat tttcatggga ctggtctggc 1680 
cacaactaca ttaatgaaat atttgccaca tcctcttaca ctttttcata cattgcccaa 1740 
gaataa 1746 

<210> 50 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F K012 



<400> 50 



Met 


Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr 


Phe Asn 


Pro 


Val 


Tyr 


Pro 


1 




5 










10 








15 




Tyx 


Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 


Thr 


Val 


Pro Phe 


Leu 


Thr 


Pro 


Pro 




20 








i 


25 








30 






Phe 


Val 


Ser 
35 


Pro 


Asn 


Gly 


Phe 


Gin 
40 


Glu" 


Ser 


Pro Pro 


Gly 
45 


Val 


Leu 


Ser 


Leu 


Arg 
50 


Leu 


Ser 


Glu 


Pro 


Leu 
55 


Val 


Thr 


Ser 


Asn Gly 
60 


Met 


Leu 


Ala 


Leu 


Lys 


Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu 


Ala Gly 


Asn 


Leu 


Thr 


Ser 


65 








70 










75 








80 


Gin 


Asn 


Val 


Thr 


Thr 
85 


Val 


Ser 


Pro 


Pro 


Leu 
90 


Lys Lys 


Thr 


Lys 


Ser 
95 


Asn 


He 


Asn 


Leu 


Glu 
100 


He 


Ser 


Ala 


Pro 


Leu 
105 


Thr 


Val Thr 


Ser 


Glu 
110 


Ala 


Leu 


Thr 


Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val 


Ala Gly 


Asn 


Thr 


Leu 


Thr 
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115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 

130 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 

165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 

180 185 190 

Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu. His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val. 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 320 

Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 330 335 

Ala- He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His* Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Ser Leu Asn Gly Gly 

405 410 415 

Gly Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

515 520 525 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thx Gin Glu Thr Gly Asp 

530 535 540 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

His Asn Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thx Phe Ser 

565 570 575 

Tyr He Ala Gin Glu 
580 
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<210> 51 
<211> 1746 ' 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> 5F S* 



<400> 51 

atgaagcgcg caagaccgtc tgaagatacc 
accggtcctc caactgtgcc ttttcttact 
gagagtcccc ctggggtact ctctttgcgc 
atgcttgcgc tcaaaatggg caacggcctc 
caaaatgtaa ccactgtgag cccacctctc 
atatctgcac ccctcacagt tacctcagaa 
atggtcgcgg gcaacacact caccatgcaa 
aaacttagca ttgccaccca aggacccctc 
acatcaggcc ccctcaccac caccgatagc 
ctaactactg ccactggtag cttgggcatt 
ggaaaactag gactaaagta cggggctcct 
accgtagcaa ctggtccagg tgtgactatt 
ggagccttgg gttttgattc acaaggcaat 
attgattctc aaaacagacg ccttatactt 
caactaaatc taagactagg acagggccct 
attaactaca acaaaggcct ttacttgttt 
gttaacctaa gcactgccaa ggggttgatg 
ggagatgggc ttgaatttgg ttcacctaat 
attggccatg gcctagaatt tgattcaaac 
cttagttttg acagcacagg tgccattaca 
ttgtggacca caccagctcc atctcctaac 
ctcactttgg tcttaacaaa atgtggcagt 
gttaaaggca gtttggctcc aatatctgga 
tttgacgaaa atggagtgct actaaacaat 
agaaatggag atcttactga aggcacagcc 
ctatcagctt atccaaaatc tcacggtaaa 
tacttaaacg gagacaaaac taaacctgta 
gaaacaggag acacaactcc aagtgcatac 
cacaactaca ttaatgaaat atttgccaca 
gaataa 

<210> 52 
<211> 581 
<212> PRT 

<213> Artificial Sequence 



ttcaaccccg tgtatccata tgacacggaa 60 • 
cctccctttg tatcccccaa tgggtttcaa 120 
ctatccgaac ctctagttac ctccaatggc 180 
tctctggacg aggccggcaa ccttacctcc 240 
ggagccggag cctcaaacat aaacctggaa 300 
gccctaactg tggctgccgc cgcacctcta 360 
tcacaggccc cgctaaccgt gcacgactcc '420 
acagtgtcag aaggaaagct agccctgcaa 480 
agtaccctta ctatcactgc ctcaccccct 540 
gacttgaaag agcccattta tacacaaaat 600 
ttgcatgtaa cagacgacct aaacactttg 660 
aataatactt ccttgcaaac taaagttact 720 
atgcaactta atgtagcagg aggactaagg 780 
gatgttagtt atccgtttga tgctcaaaac 840 
ctttttataa actcagccca caacttggat 900 
acagcttcaa acaattccaa aaagcttgag 960 
tttgacgcta cagccatagc cattaatgca 1020 
gcaccaaaca caaatcccct caaaacaaaa 1080 
aaggctatgg ttcctaaact aggaactggc 1140 
gtaggaaaca aaaataatga taagctaact 1200 
tgtagactaa atgcagagaa agatgctaaa 1260 
caaatacttg ctacagtttc agttttggct 1320 
acagttcaaa gtgctcatct tattataaga 1380 
tccttcctgg acccagaata ttggaacttt 1440 
tatacaaacg ctgttggatt tatgcctaac 1500 
actgccaaaa gtaacattgt cagtcaagtt 1560 
acactaacca ttacactaaa cggtacacag 1620 
tctatgtcat tttcatggga ctggtctggc 1680 
tcctcttaca ctttttcata cattgcccaa 1740 

1746 



<220> 

<223> 5F S* 



<400> 52 






Met Lys Arg 


Ala 


Arg 


1 




5 


Tyr Asp Thr 


Glu 


Thr 




20 




Phe Val Ser 


Pro 


Asn 


35 






Leu Arg Leu 


Ser 


Glu 


50 






Lys Met Gly 


Asn 


Gly 


65 






Gin Asn Val 


Thr 


Thr 






85 



Pro Ser Glu Asp Thr Phe 
10 

Gly Pro Pro Thr Val Pro 
25 

Gly Phe Gin Glu Ser Pro 
40 

Pro Leu Val Thr Ser Asn 
55 

Leu Ser Leu Asp Glu Ala 
70 75 
Val Ser Pro Pro Leu Gly 
90 



Asn Pro Val Tyr Pro 
15 

Phe Leu Thr Pro Pro 
30 

Pro Gly Val Leu Ser 
45 

Gly Met Leu Ala Leu 
60 

Gly Asn Leu Thr Ser 
80 

Ala Gly Ala Ser Asn 
95 
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Ile Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 

100 105 110 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 

115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 

130 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr lie Thr 

165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 

180 185 190 

Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr lie Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg lie Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 320 

Val Asn Leu- Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 330 335 

Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys Arg Leu Asn Ala Glu 

405 410 415 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

515 520 525 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 

530 535 540 

Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser Trp Asp Trp Ser Gly 
545 550 555 560 

His Asn ,Tyr He Asn Glu He Phe Ala Thr Ser Ser Tyr Thr Phe Ser 

565 570 575 

Tyr He Ala Gin Glu 
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580 



<210> 53 

<211> 1776 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> 5F S*RGD 



<400> 53 

atgaagcgcg caagaccgtc tgaagatacc 

accggtcctc caactgtgcc ttttcttact 

gagagtcccc ctggggtact ctctttgcgc 

atgcttgcgc tcaaaatggg caacggcctc 

caaaatgtaa ccactgtgag cccacctctc 

atatctgcac ccctcacagt tacctcagaa 

atggtcgcgg gcaacacact caccatgcaa 

aaacttagca ttgccaccca aggacccctc 

acatcaggcc ccctcaccac caccgatagc 

ctaactactg ccactggtag cttgggcatt 

ggaaaactag gactaaagta cggggctcct 

accgtagcaa ctggtccagg tgtgactatt 

ggagccttgg gttttgattc acaaggcaat 

attgattctc aaaacagacg ccttatactt 

caactaaatc taagactagg acagggccct 

attaactaca acaaaggcct ttacttgttt 

gttaacctaa gcactgccaa ggggttgatg 

ggagatgggc ttgaatttgg ttcacctaat 

attggccatg gcctagaatt tgattcaaac 

cttagttttg acagcacagg tgccattaca 

ttgtggacca caccagctcc atctcctaac 

ctcactttgg tcttaacaaa atgtggcagt 

gttaaaggca gtttggctcc aatatctgga 

tttgacgaaa atggagtgct actaaacaat 

agaaatggag atcttactga aggcacagcc 

ctatcagctt atccaaaatc tcacggtaaa 

tacttaaacg gagacaaaac taaacctgta 

gaaacaggtg atcattgtga ttgtcgtggt 

tctatgtcat tttcatggga ctggtctggc 

tcctcttaca ctttttcata cattgcccaa 

<210> 54 
<211> 591 
<212> PRT 

<213> Artificial Sequence 



ttcaaccccg tgtatccata tgacacggaa 60 
cctccctttg tatcccccaa tgggtttcaa 120 
ctatccgaac ctctagttac ctccaatggc 180 
tctctggacg aggccggcaa ccttacctcc 240 
ggagccggag cctcaaacat aaacctggaa 300 
gccctaactg tggctgccgc cgcacctcta 360 
tcacaggccc cgctaaccgt gcacgactcc 420 
acagtgtcag aaggaaagct agccctgcaa 480 
agtaccctta ctatcactgc ctcaccccct 540 
gacttgaaag agcccattta tacacaaaat 600 
ttgcatgtaa cagacgacct aaacactttg 660 
aataatactt ccttgcaaac taaagttact 720 
atgcaactta atgtagcagg aggactaagg 780 
gatgttagtt atccgtttga • tgctcaaaac 840 
ctttttataa actcagccca caacttggat 900 
acagcttcaa acaattccaa aaagcttgag 960 
tttgacgcta cagccatagc cattaatgca 1020 
gcaccaaaca caaatcccct caaaacaaaa 10 80 
aaggctatgg ttcctaaact aggaactggc 1140 
gtaggaaaca aaaataatga taagctaact 12 00 
tgtagactaa atgcagagaa agatgctaaa 1260 
caaatacttg ctacagtttc agttttggct 1320 
acagttcaaa gtgctcatct tattataaga 1380 
tccttcctgg acccagaata ttggaacttt 1440 
tatacaaacg ctgttggatt tatgcctaac 1500 
actgccaaaa gtaacattgt cagtcaagtt 1560 
acactaacca ttacactaaa cggtacacag 1620 
gattgttttt gtacaactcc aagtgcatac 1680 
cacaactaca ttaatgaaat atttgccaca 1740 
gaataa 1776 



<220> 

<223> 5F S*RGD 



<400> 54 

Met Lys Arg Ala Arg Pro Ser Glu 

1 5 
Tyr Asp Thr Glu Thr Gly Pro Pro 
20 

p ne val Ser Pro Asn Gly Phe Gin 

35 40 
Leu Arg Leu Ser Glu Pro Leu Val 

50 55 
Lys Met Gly Asn Gly Leu Ser Leu 



Asp Thr Phe Asn Pro Val Tyr Pro 

10 15 
Thr Val Pro Phe Leu Thr Pro Pro 
25 30 
Glu Ser Pro Pro Gly Val Leu Ser 
45 

Thr Ser Asn Gly Met Leu Ala Leu 
60 

Asp Glu Ala Gly Asn Leu Thr Ser 
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65 






70 




75 








80 


Gin Asn Val 


Thr 


Thr 


Val 


Ser Pro Pro Leu Gly Ala Gly 


Ala 


Ser 


Asn 






85 




90 








95 




lie Asn Leu 


Glu 


lie 


Ser 


Ala Pro Leu Thr 


Val Thr 


Ser 


Glu 


Ala 


Leu 




100 






105 






110 






Thr Val Ala 


Ala 


Ala 


Ala 


Pro Leu Met Val Ala Gly Asn 


Thr 


Leu 


Thr 


115 








120 




125 








Met Gin Ser 


Gin 


Ala 


Pro 


Leu Thr Val His 


Asp Ser 


Lys 


Leu 


Ser 


He 


130 








135 


140 










Ala Thr Gin 


Gly 


Pro 


Leu 


Thr Val Ser Glu 


Gly Lys 


Leu 


Ala 


Leu 


Gin 


145 






150 




155 








160 


Thr Ser Gly 


Pro 


Leu 


Thr 


Thr Thr Asp Ser 


Ser Thr 


Leu 


Thr 


He 


Thr 




165 




170 








175 




Ala Ser Pro 


Pro 


Leu Thr Thr Ala Thr Gly 


Ser Leu Gly 


He Asp 


Leu 




180 






185 






190 






Lys Glu Pro 


lie 


Tyr Thr Gin Asn Gly Lys 


Leu Gly 


Leu 


Lys 


Tyr 


Gly 


195 








200 




205 








Ala Pro Leu 


His 


Val 


Thr Asp Asp Leu Asn 


Thr Leu 


Thr 


Val 


Ala 


Thr 


210 








215 


220 










Gly Pro Gly Val 


Thr 


He 


Asn Asn Thr Ser 


Leu Gin 


Thr 


Lys 


Val 


Thr 


225 






230 




235 








240 


Gly Ala Leu 


Gly 


Phe 


Asp 


Ser Gin Gly Asn 


Met Gin 


Leu 


Asn 


Val 


Ala 




245 




250 








255 




Gly Gly Leu Arg 


He 


Asp 


Ser Gin Asn Arg 


Arg Leu 


He 


Leu Asp 


Val 




260 






265 






270 






Ser Tyr Pro 


Phe 


Asp 


Ala 


Gin Asn Gin Leu 


Asn Leu 


Arg 


Leu Gly 


Gin 


275 








280 




285 








Gly Pro Leu 


Phe 


He 


Asn 


Ser Ala His Asn 


Leu Asp 


He 


Asn Tyr 


Asn 


290 








295 


300 










Lys Gly Leu 


Tyr 


Leu 


Phe 


Thr Ala Ser Asn 


Asn Ser 


Lys 


Lys 


Leu 


Glu 


305 






310 




315 








320 


Val Asn Leu 


Ser 


Thr 


Ala 


Lys Gly Leu Met 


Phe Asp Ala 


Thr 


Ala 


He 






325 




330 








335 




Ala lie Asn 


Ala 


Gly Asp Gly Leu Glu Phe 


Gly Ser 


Pro 


Asn 


Ala 


Pro 




340 






345 






350 






Asn Thr Asn 


Pro 


Leu 


Lys 


Thr Lys He Gly His Gly 


Leu 


Glu 


Phe 


Asp 


355 








360 




365 








Ser Asn Lys 


Ala 


Met 


Val 


Pro Lys Leu Gly Thr Gly 


Leu 


Ser 


Phe 


Asp 


370 








375 


380 










Ser Thr Gly Ala 


He 


Thr 


Val Gly Asn Lys Asn Asn Asp 


Lys 


Leu 


Thr 


385 






390 




395 








400 


Leu Trp Thr 


Thr 


Pro 


Ala 


Pro Ser Pro Asn 


Cys Arg 


Leu 


Asn 


Ala 


Glu 




405 




410 








415 




Lys Asp Ala 


Lys 


Leu 


Thr 


Leu Val Leu Thr 


Lys Cys 


Gly 


Ser 


Gin 


He 




420 






425 






430 






Leu Ala Thr 


Val 


Ser 


Val 


Leu Ala Val Lys 


Gly Ser 


Leu 


Ala 


Pro 


He 


435 








440 




445 








Ser Gly Thr Val 


Gin 


Ser 


Ala His Leu He 


He Arg 


Phe 


Asp 


Glu 


Asn 


450 








455 


460 










Gly Val Leu 


Leu 


Asn 


Asn 


Ser Phe Leu Asp 


Pro Glu 


Tyr 


Trp Asn 


Phe 


465 






470 




475 








480 


Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn 


Ala 


Val 


Gly 






485 




490 








495 




Phe Met Pro 


Asn 


Leu 


Ser 


Ala Tyr Pro Lys 


Ser His 


Gly 


Lys 


Thr 


Ala 




500 






505 






510 






Lys Ser Asn 


lie 


Val 


Ser 


Gin Val Tyr Leu Asn Gly Asp 


Lys 


Thr 


Lys 


515 








520 




525 








Pro Val Thr 


Leu 


Thr 


He 


Thr Leu Asn Gly 


Thr Gin 


Glu 


Thr Gly 


Asp 


530 








535 


54 0 










His Cys Asp 


Cys 


Arg 


Gly Asp Cys Phe Cys 


Thr Thr 


Pro 


Ser 


Ala 


Tyr 


545 






550 




555 








560 
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Ser Met Ser Phe Ser Trp Asp Trp Ser Gly His Asn Tyr He Asn .Glu 

565 570 t 

He Phe Ala Thr Ser Ser Tyr Thr Phe Ser Tyr lie Ala Gin Glu 
580 585 >w 



<210> 55 
<211> 1746 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5F KOIS* 
<400> 55 



a^aagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
acgaaguyt-y a 4 _i-„ 1 - + . ar ,*- rrtcccttta tatcccccaa tgggtttcaa x^u 

III ill lis ss is fill I 

life ISiS iiE 132 SSI si I 

glllililillliillill 

ilifi &sk ssess is SSI I I S 

~i ilif EiS iili b|i ssii! is 

ins esis ins sii ssi lis e 

ttg^Saccl caclagctcl agaggctaac tgtagactaa atgcagagaa agatgctaaa 1260 

Ills ilii -ill S£ i |g §fs & 

iflll IS! Sill SIS! is ii 

lEHif SEES SS2S SIS 

gaataa 



<210> 56 
<211> 581 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F KOIS* 



^ttiyl 6 ^ Ala-Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 
Tyr Asp Thr Glu tL Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 

35 4 0 
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Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 

85 90 95 

lie Asn Leu Glu lie Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 

100 105 110 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 

115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser lie 

130 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr He Thr 

165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 

180 185 190 

Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 320 

Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 330 335 

Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met' Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 

405 410 415 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 

500 505 510 

Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn Gly Asp Lys Thr Lys 

515 520 525 

Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr Gin Glu Thr Gly Asp 
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530 535 
Thr Thr Pro Ser Ala Tyr Ser Met 
545 550 
His Asn Tyr lie Asn Glu lie Phe 
565 

Tyr lie Ala Gin Glu 
580 



540 

Ser Phe Ser Trp Asp Trp Ser Gly 
555 560 
Ala Thr Ser Ser Tyr Thr Phe Ser 
570 575 



<210> 57 
<211> 1776 
<212> DHA 

<213> Artificial Sequence 
<220> 

<223> 5F K01S*RGD 
<400> 57 

atgaagcgcg caagaccgtc tgaagatacc ttcaaccccg tgtatccata tgacacggaa 60 
accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 120 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 
caaaatgtaa ccactgtgag cccacctctc ggagccggag cctcaaacat aaacctggaa 300 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 420 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 480 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 600 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 720 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 840 
caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 900 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 1020 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 1080 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 1200 
ttgtggacca caccagctcc agaggctaac tgtagactaa atgcagagaa agatgctaaa 1260 
ctcactttgg tcttaacaaa atgtggcagt caaatacttg ctacagtttc agttttggct 1320 
gttaaaggca gtttggctcc aatatctgga acagttcaaa gtgctcatct tattataaga 1380 
tttgacgaaa atggagtgct actaaacaat tccttcctgg acccagaata ttggaacttt 1440 
agaaatggag atcttactga aggcacagcc tatacaaacg ctgttggatt tatgcctaac 1500 
ctatcagctt atccaaaatc tcacggtaaa actgccaaaa gtaacattgt cagtcaagtt 1560 
tacttaaacg gagacaaaac taaacctgta acactaacca ttacactaaa cggtacacag 1620 
gaaacaggtg atcattgtga ttgtcgtggt gattgttttt gtacaactcc aagtgcatac 1680 
tctatgtcat tttcatggga ctggtctggc cacaactaca ttaatgaaat atttgccaca 1740 
tcctcttaca ctttttcata cattgcccaa gaataa 1776 

<210> 58 
<211> 591 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5F K01S*RGD 
<400> 58 

Met Lys Arg Ala Arg Pro Ser Glu Asp Thr Phe Asn Pro Val Tyr Pro 

1 5 10 15 

Tyr Asp Thr Glu Thr Gly Pro Pro Thr Val Pro Phe Leu Thr Pro Pro 
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20 25 30 

Phe Val Ser Pro Asn Gly Phe Gin Glu Ser Pro Pro Gly Val Leu Ser 

35 40 45 

Leu Arg Leu Ser Glu Pro Leu Val Thr Ser Asn Gly Met Leu Ala Leu 

50 55 60 

Lys Met Gly Asn Gly Leu Ser Leu Asp Glu Ala Gly Asn Leu Thr Ser 
65 70 75 80 

Gin Asn Val Thr Thr Val Ser Pro Pro Leu Gly Ala Gly Ala Ser Asn 

85 90 95 

lie Asn Leu Glu He Ser Ala Pro Leu Thr Val Thr Ser Glu Ala Leu 

100 105 110 

Thr Val Ala Ala Ala Ala Pro Leu Met Val Ala Gly Asn Thr Leu Thr 

115 120 125 

Met Gin Ser Gin Ala Pro Leu Thr Val His Asp Ser Lys Leu Ser He 

130 135 140 

Ala Thr Gin Gly Pro Leu Thr Val Ser Glu Gly Lys Leu Ala Leu Gin 
145 150 155 160 

Thr Ser Gly Pro Leu Thr Thr Thr Asp Ser Ser Thr Leu Thr He Thr 

165 170 175 

Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly He Asp Leu 

180 185 190 

Lys Glu Pro He Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr He Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 255 

Gly Gly Leu Arg He Asp Ser Gin Asn Arg Arg Leu He Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe He Asn Ser Ala His Asn Leu Asp He Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 320 

Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala He 

325 330 335 

Ala He Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys He Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

Leu Trp Thr Thr Pro Ala Pro Glu Ala Asn Cys Arg Leu Asn Ala Glu 

405 410 " 415 

Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys Cys Gly Ser Gin He 

420 425 430 

Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly Ser Leu Ala Pro He 

435 440 445 

Ser Gly Thr Val Gin Ser Ala His Leu He He Arg Phe Asp Glu Asn 

450 455 460 

Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro Glu Tyr Trp Asn Phe 
465 470 475 480 

Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr Thr Asn Ala Val Gly 

485 490 495 

Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser His Gly Lys Thr Ala 
500 505 510 
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Lys 


Ser 


Asn 


He 


Val 


Ser Gin 


Val Tyr Leu Asn 


Gly 


Asp 


Lys 


Thr 


Lys 




515 








520 




525 








Pro 


Val 


Thr 


Leu 


Thr 


He Thr 


Leu Asn Gly Thr 


Gin Glu Thr Gly 


Asp 


530 








535 




540 










His 


Cys 


Asp 


Cys 


Arg 


Gly Asp 


Cys Phe Cys Thr 


Thr 


Pro 


Ser 


Ala 


Tyr 


545 






550 


555 










560 


Ser 


Met 


Ser 


Phe 


Ser 


Trp Asp 


Trp Ser Gly His 


Asn 


Tyr 


He 


Asn 


Glu 










565 


570 








575 




He 


Phe 


Ala 


Thr 


Ser 


Ser Tyr 


Thr Phe Ser Tyr 


He 


Ala 


Gin 


Glu 










580 




585 






590 







<210> 59 
<211> 972 
<212> DNA 

<213> Artificial Seqfuence 

<220> 
<223> 35F 

<400> 59 

atgaccaaga gagtccggct cagtgactcc ttcaaccctg tctaccccta tgaagatgaa 60 

agcacctccc aacacccctt tataaaccca gggtttattt ccccaaatgg cttcacacaa 120 

agcccagacg gagttcttac tttaaaatgt ttaaccccac taacaaccac aggcggatct 180 

ctacagctaa aagtgggagg gggacttaca gtggatgaca ctgatggtac cttacaagaa 240 

aacatacgtg ctacagcacc cattactaaa aataatcact ctgtagaact atccattgga 300 

aatggattag aaactcaaaa caataaacta tgtgccaaat tgggaaatgg gttaaaattt 360 

aacaacggtg acatttgtat aaaggatagt attaacacct tatggactgg aataaaccct 420 

ccacctaact gtcaaattgt ggaaaacact aatacaaatg atggcaaact tactttagta 480 

ttagtaaaaa atggagggct tgttaatggc tacgtgtctc tagttggtgt atcagacact 540 

gtgaaccaaa tgttcacaca aaagacagca aacatccaat taagattata ttttgactct 600 

tctggaaatc tattaactga ggaatcagac ttaaaaattc cacttaaaaa taaatcttct 660 

acagcgacca gtgaaactgt agccagcagc aaagccttta tgccaagtac tacagcttat 720 

cccttcaaca ccactactag ggatagtgaa aactacattc atggaatatg ttactacatg 780 

actagttatg atagaagtct atttcccttg aacatttcta taatgctaaa cagccgtatg 840 

atttcttcca atgttgccta tgccatacaa tttgaatgga atctaaatgc aagtgaatct 900 

ccagaaagca acatagctac gctgaccaca tccccctttt tcttttctta cattacagaa 960 
gacgacgaat aa 972 

<210> 60 
<211> 323 
<212> PRT 

<213> Artificial Sequence 

<220> 
<223> 35F 



<400> 60 



Met Thr 


Lys 


Arg 


Val 


Arg 


Leu 


Ser 


Asp 


Ser 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 


5 










10 










15 




Tyr Glu 


Asp 


Glu 


Ser 


Thr 


Ser 


Gin 


His 


Pro 


Phe 


He 


Asn 


Pro 


Gly 


Phe 


20 










25 










30 






He Ser 


Pro 


Asn 


Gly 


Phe 


Thr 


Gin 


Ser 


Pro 


Asp 


Gly 


Val 


Leu 


Thr 


Leu 




35 








40 










45 








Lys Cys 


Leu 


Thr 


Pro 


Leu 


Thr 


Thr 


Thr 


Gly 


Gly 


Ser 


Leu 


Gin 


Leu 


Lys 


50 










55 










60 










Val Gly 


Gly 


Gly 


Leu 


Thr 


Val 


Asp 


Asp 


Thr 


Asp 


Gly 


Thr 


Leu 


Gin 


Glu 


65 






70 










75 




His 






80 


Asn He 


Arg 


Ala 


Thr 


Ala 


Pro 


He 


Thr 


Lys 


Asn 


Asn 


Ser 


Val 


Glu 




85 










90 










95 




Leu Ser 


He 


Gly 


Asn 


Gly 


Leu 


Glu 


Thr 


Gin 


Asn 


Asn 


Lys 


Leu 


Cys 


Ala 
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100 










Lys 


Leu Gly 


Asn Gly 


Leu 


Lys 


Phe 




115 










120 


Asp 


Ser He 


Asn 


Thr 


Leu 


Trp 


Thr 


130 








135 




Gin 


He Val 


Glu 


Asn 


Thr 


Asn 


Thr 


145 








150 






Leu 


Val Lys 


Asn Gly 


Gly 


Leu 


Val 






165 








Val 


Ser Asp 


Thr 


Val 


Asn 


Gin 


Met 






180 










Gin 


Leu Arg 


Leu 


Tyr 


Phe 


Asp 


Ser 




195 










200 


Ser 


Asp Leu 


Lys 


He 


Pro 


Leu 


Lys 




210 








215 




Glu 


Thr Val 


Ala 


Ser 


Ser 


Lys 


Ala 


225 








230 






Pro 


Phe Asn 


Thr 


Thr 


Thr 


Arg Asp 








245 








Cys 


Tyr Tyr 


Met 


Thr 


Ser 


Tyr Asp 




260 










Ser 


He Met 


Leu 


Asn 


Ser 


Arg 


Met 




275 










280 


He 


Gin Phe 


Glu 


Trp 


Asn 


Leu 


Asn 




290 








295 




He 


Ala Thr 


Leu 


Thr 


Thr 


Ser 


Pro 


305 








310 






Asp 


Asp Glu 













105 










110 






Asn 


Asn Gly Asp 


He 


Cys 


He 


Lys 










125 








Gly 


He 


Asn 


Pro 


Pro 


Pro 


Asn 


Cys 






140 










Asn 


Asp 


Gly Lys 


Leu 


Thr 


Leu 


Val 






155 










160 


Asn 


Gly Tyr Val 


Ser Leu Val Gly 




170 










175 




Phe 


Thr 


Gin 


Lys 


Thr 


Ala 


Asn 


He 


185 










190 






Ser 


Gly Asn 


Leu 


Leu 


Thr 


Glu 


Glu 










205 








Asn 


Lys 


Ser 


Ser 


Thr 


Ala 


Thr 


Ser 






220 










Phe 


Met 


Pro 

235. 


Ser 


Thr 


Thr 


Ala 


Tyr 
240 


Ser 


Glu Asn 


Tyr 


He 


His 


Gly 


He 




250 










255 




Arg 


Ser 


Leu 


Phe 


Pro 


Leu 


Asn 


He 


265 










270 






He 


Ser 


Ser 


Asn 


Val 
285 


Ala 


Tyr 


Ala 


Ala 


Ser 


Glu 


Ser 
300 


Pro 


Glu 


Ser 


Asn 


Phe 


Phe 


Phe 
315 


Ser 


Tyr 


He 


Thr 


Glu 
320 



<210> 61 
<211> 1002 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 35F RGD 
<400> 61 

atgaccaaga gagtccggct cagtgactcc 
agcacctccc aacacccctt tataaaccca 
agcccagacg gagttcttac tttaaaatgt 
ctacagctaa aagtgggagg gggacttaca 
aacatacgtg ctacagcacc cattactaaa 
aatggattag aaactcaaaa caataaacta 
aacaacggtg acatttgtat aaaggatagt 
ccacctaact gtcaaattgt ggaaaacact 
ttagtaaaaa atggagggct tgttaatggc 
gtgaaccaaa tgttcacaca aaagacagca 
tctggaaatc tattaactga ggaatcagac 
acagcgacca gtgaaactgt agccagcagc 
cccttcaaca ccactactag ggatagtgaa 
actagttatg atagaagtct atttcccttg 
atttcttcca atgtacattg tgattgtcgt 
tttgaatgga atctaaatgc aagtgaatct 
tccccctttt tcttttctta cattacagaa 



ttcaaccctg tctaccccta tgaagatgaa 60 
gggtttattt ccccaaatgg cttcacacaa 120 
ttaaccccac taacaaccac aggcggatct 180 
gtggatgaca ctgatggtac cttacaagaa 240 
aataatcact ctgtagaact atccattgga 300 
tgtgccaaat tgggaaatgg gttaaaattt 360 
attaacacct tatggactgg aataaaccct 420 
aatacaaatg atggcaaact tactttagta 480 
tacgtgtctc tagttggtgt atcagacact 540 
aacatccaat taagattata ttttgactct 600 
ttaaaaattc cacttaaaaa taaatcttct 660 
aaagccttta tgccaagtac tacagcttat 720 
aactacattc atggaatatg ttactacatg 780 
aacatttcta taatgctaaa cagccgtatg 840 
ggtgattgtt tttgcgcata tgccatacaa 900 
ccagaaagca acatagctac gctgaccaca 960 
gacgacgaat aa 1002 



<210> 62 
<211> 333 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> 35F RGD 



<400> 62 



Met 


Thr Lys 


Arg 


Val 


Arg 


Leu 


Ser 


Asp Ser 


Phe 


Asn 


Pro 


Val 


Tyr 


Pro 


1 






5 








10 










15 




Tyr Glu Asp 


Glu 


Ser 


Thr 


Ser 


Gin 


His Pro 


Phe 


He 


Asn 


Pro Gly 


Phe 






20 










25 








30 






lie 


Ser Pro 


Asn Gly Phe 


Thr 


Gin 


Ser Pro 


Asp 


Gly 


Val 


Leu 


Thr 


Leu 




35 










40 








45 








Lys 


Cys Leu 


Thr 


Pro 


Leu 


Thr 


Thr 


Thr Gly Gly 


Ser 


Leu 


Gin 


Leu 


Lys 


50 








55 








60 










Val 


Gly Gly 


Gly 


Leu 


Thr 


Val 


Asp 


Asp Thr Asp Gly 


Thr 


Leu 


Gin 


Glu 


65 








70 








75 










80 


Asn 


lie Arg 


Ala 


Thr 


Ala 


Pro 


He 


Thr Lys 


Asn 


Asn 


His 


Ser 


Val 


Glu 






85 








90 










95 




Leu 


Ser lie 


Gly Asn Gly Leu 


Glu 


Thr Gin 


Asn 


Asn 


Lys 


Leu 


Cys 


Ala 






100 










105 








110 






Lys 


Leu Gly 


Asn Gly Leu Lys 


Phe 


Asn Asn Gly Asp 


He 


Cys 


He 


Lys 


115 










120 








125 








Asp 


Ser He 


Asn 


Thr 


Leu 


Trp 


Thr 


Gly He 


Asn 


Pro 


Pro 


Pro 


Asn 


Cys 


130 








135 








140 










Gin 


He Val 


Glu 


Asn 


Thr 


Asn 


Thr 


Asn Asp Gly Lys 


Leu 


Thr 


Leu 


Val 


145 








150 








155 










160 


Leu 


Val Lys 


Asn Gly Gly Leu Val 


Asn Gly Tyr Val 


Ser 


Leu 


Val 


Gly 








165 








170 










175 




Val 


Ser Asp 


Thr 


Val 


Asn 


Gin 


Met 


Phe Thr 


Gin 


Lys 


Thr 


Ala 


Asn 


He 




180 










185 








190 






Gin Leu Arg 


Leu 


Tyr 


Phe 


Asp 


Ser 


Ser Gly Asn Leu 


Leu 


Thr 


Glu 


Glu 




195 










200 








205 








Ser Asp Leu 


Lys 


He 


Pro 


Leu 


Lys 


Asn Lys 


Ser 


Ser 


Thr 


Ala 


Thr 


Ser 




210 








215 








220 










Glu 


Thr Val 


Ala 


Ser 


Ser 


Lys 


Ala 


Phe Met 


Pro 


Ser 


Thr 


Thr 


Ala 


Tyr 


225 








230 








235 










240 


Pro 


Phe Asn 


Thr 


Thr Thr Arg Asp 


Ser Glu 


Asn 


Tyr 


He 


His 


Gly 


He 








245 








250 










255 




Cys 


Tyr Tyr 


Met 


Thr 


Ser 


Tyr Asp 


Arg Ser 


Leu 


Phe 


Pro 


Leu 


Asn 


He 






260 










265 








270 






Ser 


He Met 


Leu 


Asn 


Ser Arg 


Met 


He Ser 


Ser 


Asn 


Val 


His 


Cys 


Asp 




275 










280 








285 








Cys Arg Gly 


Asp 


Cys 


Phe 


Cys 


Ala 


Tyr Ala 


He 


Gin 


Phe 


Glu 


Trp 


Asn 




290 








295 








300 










Leu 


Asn Ala 


Ser 


Glu 


Ser 


Pro 


Glu 


Ser Asn 


He 


Ala 


Thr 


Leu 


Thr 


Thr 


305 








310 








315 










320 


Ser 


Pro Phe 


Phe 


Phe 


Ser 


Tyr 


He 


Thr Glu Asp 


Asp 


Glu 









325 330 



<210> 63 
<211> 1164 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 41sF 
<400> 63 

atgaaaagaa ccagaattga agacgacttc aaccccgtct acccctatga caccttctca 60 
actcccagca tcccctatgt agctccgccc ttcgtttctt ctgacgggtt acaggaaaaa 120 
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cccccaggag 
cttactttaa 
cccactgttt 
cccatagctg 
aacaaccagc 
gttaatccta 
ggaggaatga 
atcaaccagc 
aacgttaagt 
agcaaccgaa 
aactgctcca 
gctcacgtat 
gataacgctc 
ttggaatcat 
tttatgccca 
atccaaatct 
gcttttactt 
ttttgctaca 



ttttagcact 
aacttggaag 
ctcctcccct 
tatcagctaa 
ttagtattaa 
ccggagcgct 
gagtggacgg 
taacactgcg 
tgggatcagg 
ctcgaagtgt 
tttatgaaac 
taggtactat 
tatctttaaa 
ccacctggcg 
acagtacagt 
cgcctaacat 
ttaaatggtc 
taactgaaga 



caagtacact 
caacataact 
tacaaacagt 
ctctctcaca 
cgcgggcaga 
aggtttcaat 
tgccaactta 
attagaaaac 
cctccaattt 
accatccctc 
ccaagatgca 
aacaatcaaa 
acttcccttt 
ttaccaggaa 
gtatccacga 
caccttcagt 
agccgaaccg 
ataa 



gaccccatta 
ttagaaaatg 
aacaactccc 
ttggccaccg 
ggtttagtta 
aacacaggag 
attcttcatg 
gggttagaag 
gacagtaacg 
actaccattt 
aacctatttc 
ggtcttaaag 
gacaatcagg 
accaacgcag 
aacaaaaccg 
gtcgtctaca 
ggaaaacctt 



ctaccaatgc 
ggttactttc 
tgggtttagc 
ccgcaccact 
taactaacaa 
ctttacaatt 
tagcatatcc 
taaccagcgg 
gacgcattgc 
ggtctatctc 
tttgtctaac 
gagcactgcg 
gaaatttact 
tggcctctaa 
ctcacccggg 
acgagataaa 
ttcacccacc 



taagcatgag 
ggccacagtt 
cacatccgct 
gacagtaagc 
tgccttaaca 
aaatgctgca 
ctttgaagca 
aggaaagctt 
tattagtaat 
gcctacgcct 
taaaaacgga 
ggaaatgcac 
taactgtgcc 
tgccttaaca 
caacatgctc 
cagtgggtat 
taccgctgta 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1164 



<210> 64 
<211> 387 
<212> PRT 

<213> Artificial Sequence 

<220> 
<223> 41sF 



<400> 64 

Met Lys Arg Thr Arg lie Glu Asp 

1 5 
Asp Thr Phe Ser Thr Pro Ser lie 
20 

Ser Ser Asp Gly Leu Gin Glu Lys 

35 40 
Tyr Thr Asp Pro lie Thr Thr Asn 

50 55 
Leu Gly Ser Asn lie Thr Leu Glu 
65 70 
Pro Thr Val Ser Pro Pro Leu Thr 
85 

Ala Thr Ser Ala Pro lie Ala Val 
100 

Thr Ala Ala Pro Leu Thr Val Ser 
115 120 
Gly Arg Gly Leu Val lie Thr Asn 

130 135 
Gly Ala Leu Gly Phe Asn Asn Thr 
145 150 
Gly Gly Met Arg Val Asp Gly Ala 
165 

Pro Phe Glu Ala lie Asn Gin Leu 
180 

Glu Val Thr Ser Gly Gly Lys Leu 
195 200 
Gin Phe Asp Ser Asn Gly Arg lie 

210 215 
Arg Ser Val Pro Ser Leu Thr Thr 
225 230 
Asn Cys Ser lie Tyr Glu Thr Gin 
245 

Thr Lys Asn Gly Ala His Val Leu 



Asp Phe Asn Pro Val Tyr 
10 

Pro Tyr Val Ala 
25 

Pro Pro Gly Val 



Ala Lys His Glu 
60 

Asn Gly Leu Leu 
75 

Asn Ser Asn Asn 
90 

Ser Ala Asn Ser 
105 

Asn Asn Gin Leu 

Asn Ala Leu Thr 
140 

Gly Ala Leu Gin 
155 

Asn Leu lie Leu 
170 

Thr Leu Arg Leu 
185 

Asn Val Lys Leu 

Ala lie Ser Asn 
220 

lie Trp Ser lie 
235 

Asp Ala Asn Leu 
250 

Gly Thr lie Thr 



Pro Pro 

30 
Leu Ala 
45 

Leu Thr 



Ser Ala 

Ser Leu 

Leu Thr 
110 
Ser lie 
125 

Val Asn 

Leu Asn 

His Val 

Glu Asn 
190 
Gly Ser 
205 

Ser Asn 
Ser Pro 
Phe Leu 
lie Lys 



Pro Tyr 
15 

Phe Val 

Leu Lys 

Leu Lys 

Thr Val 

80 
Gly Leu 
95 

Leu Ala 

Asn Ala 

Pro Thr 

Ala Ala 
160 
Ala Tyr 
175 

Gly Leu 

Gly Leu 

Arg Thr 

Thr Pro 
240 
Cys Leu 
255 

Gly Leu 
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260 


Lys 


Gly Ala 


Leu 






275 




Pro 


Phe 


Asp 


Asn 




290 






Thr 


Trp 


Arg 


Tyr 


305 








Phe 


Met 


Pro 


Asn 


Gly 


Asn 


Met 


Leu 








340 


Tyr 


Asn 


Glu 


lie 






355 




Glu 


Pro Gly 


Lys 




370 






Thr 


Glu 


Glu 




385 









Arg 


Glu 


Met 


His 








280 


Gin 


Gly Asn 


Leu 






295 




Gin 


Glu 


Thr 


Asn 




310 






Ser 


Thr 


Val 


Tyr 


325 








He 


Gin 


He 


Ser 


Asn 


Ser 


Gly 


Tyr 








360 


Pro 


Phe 


His 


Pro 






375 





265 

Asp Asn Ala Leu 

Leu Asn Cys Ala 
300 

Ala Val Ala Ser 
315 

Pro Arg Asn Lys 
330 

Pro Asn He Thr 
345 

Ala Phe Thr Phe 

Pro Thr Ala Val 
380 





270 






Ser 


Leu 


Lys 


Leu 


285 








Leu 


Glu 


Ser 


Ser 


Asn 


Ala 


Leu 


Thr 








320 


Thr 


Ala 


His 


Pro 






335 




Phe 


Ser 


Val 


Val 




350 






Lys 


Trp 


Ser 


Ala 


365 








Phe 


Cys 


Tyr 


He 



<210> 65 
<211> 1194 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> 41sF RGD 
<400> 65 

atgaaaagaa ccagaattga agacgacttc aaccccgtct acccctatga caccttctca 60 
actcccagca tcccctatgt agctccgccc ttcgtttctt ctgacgggtt acaggaaaaa 12 0 
cccccaggag ttttagcact caagtacact gaccccatta ctaccaatgc taagcatgag 180 
cttactttaa aacttggaag caacataact ttagaaaatg ggtfcactttc ggccacagtt 240 
cccactgttt ctcctcccct tacaaacagt aacaactccc tgggtttagc cacatccgct 300 
cccatagctg tatcagctaa ctctctcaca ttggccaccg ccgcaccact gacagtaagc 360 
aacaaccagc ttagtattaa cgcgggcaga ggtttagtta taactaacaa tgccttaaca 42 0 
gttaatccta ccggagcgct aggtttcaat aacacaggag ctttacaatt aaatgctgca 480 
ggaggaatga gagtggacgg tgccaactta attcttcatg tagcatatcc ctttgaagca 540 
atcaaccagc taacactgcg attagaaaac gggttagaag taaccagcgg aggaaagctt 600 
aacgttaagt tgggatcagg cctccaattt gacagtaacg gacgcattgc tattagtaat 660 
agcaaccgaa ctcgaagtgt accatccctc actaccattt ggtctatctc gcctacgcct 720 
aactgctcca tttatgaaac ccaagatgca aacctatttc tttgtctaac taaaaacgga 780 
gctcacgtat taggtactat aacaatcaaa ggtcttaaag gagcactgcg ggaaatgcac 840 
gataacgctc tatctttaaa acttcccttt gacaatcagg gaaatttact taactgtgcc 900 
ttggaatcat ccacctggcg ttaccaggaa accaacgcag tggcctctaa tgccttaaca 960 
tttatgccca acagtacagt gtatccacga aacaaaaccg ctcacccggg caacatgctc 1020 
atccaaatct cgcctaacat caccttcagt gtcgtctaca acgagataaa ctgtgattgt 1080 
cgtggtgatt gtttttgtac tagtgggtat gcttttactt ttaaatggtc agccgaaccg 1140 
ggaaaacctt ttcacccacc taccgctgta ttttgctaca taactgaaga ataa 1194 

<210> 66 
<211> 397 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 41sF RGD 
<400> 66 

Met Lys Arg Thr Arg He Glu Asp Asp Phe Asn Pro Val Tyr Pro Tyr 

15 10 15 

Asp Thr Phe Ser Thr Pro Ser He Pro Tyr Val Ala Pro Pro Phe Val 
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20 25 30 

Ser Ser Asp Gly Leu Gin Glu Lys Pro Pro Gly Val Leu Ala Leu Lys 

35 40 45 

Tyr Thr Asp Pro lie Thr Thr Asn Ala Lys His Glu Leu Thr Leu Lys 

50 55 60 

Leu Gly Ser Asn lie Thr Leu Glu Asn Gly Leu Leu Ser Ala Thr Val 
65 70 75 80 

Pro Thr Val Ser Pro Pro Leu Thr Asn Ser Asn Asn Ser Leu Gly Leu 

85 90 95 

Ala Thr Ser Ala Pro lie Ala Val Ser Ala Asn Ser Leu Thr Leu Ala 

100 105 110 

Thr Ala Ala Pro Leu Thr Val Ser Asn Asn Gin Leu Ser He Asn Ala 

115 120 125 

Gly Arg Gly Leu Val He Thr Asn Asn Ala Leu Thr Val Asn Pro Thr 

130 135 140 

Gly Ala Leu Gly- Phe Asn Asn Thr Gly Ala Leu Gin Leu Asn Ala Ala 
145 150 155 160 

Gly Gly Met Arg Val Asp, Gly Ala Asn Leu He Leu His Val Ala Tyr 

165 170 175 

Pro Phe Glu Ala He Asn Gin Leu Thr Leu Arg Leu Glu Asn Gly Leu 

180 185 190 

Glu Val Thr Ser Gly Gly Lys Leu Asn Val Lys Leu Gly Ser Gly Leu 

195 200 205 

Gin Phe Asp Ser Asn Gly Arg He Ala He Ser Asn Ser Asn Arg Thr 

210 215 220 

Arg Ser Val Pro Ser Leu Thr Thr He Trp Ser He Ser Pro Thr Pro 
225 230 235 240 

Asn Cys Ser He Tyr Glu Thr Gin Asp Ala Asn Leu Phe Leu Cys Leu 

245 250 255 

Thr Lys Asn Gly Ala His Val Leu Gly Thr He Thr He Lys Gly Leu 

260 265 270 

Lys Gly Ala Leu Arg Glu Met 'His Asp Asn Ala Leu Ser Leu Lys Leu 

275 280 285 

Pro Phe Asp Asn Gin Gly Asn Leu Leu Asn Cys Ala Leu Glu Ser Ser 

290 295 300 

Thr Trp Arg Tyr Gin Glu Thr Asn Ala Val Ala Ser Asn Ala Leu Thr 
305 310 315 320 

Phe Met Pro Asn Ser Thr Val Tyr Pro Arg Asn Lys Thr Ala His Pro 

325 330 335 

Gly Asn Met Leu He Gin He Ser Pro Asn He Thr Phe Ser Val Val 

340 345 350 

Tyr Asn Glu He Asn Cys Asp Cys Arg Gly Asp Cys Phe Cys Thr Ser 

355 360 365 

Gly Tyr Ala Phe Thr Phe Lys Trp Ser Ala Glu Pro Gly Lys Pro Phe 

370 375 380 

His Pro Pro Thr Ala Val Phe Cys Tyr He Thr Glu Glu 
385 390 395 



<210> 67 
<211> 1737 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ad5 PD1 pent on 
<400> 67 

atgcggcgcg cggcgatgta tgaggaaggt cctcctccct cctacgagag tgtggtgagc 60 
gcggcgccag tggcggcggc gctgggttct cccttcgatg ctcccctgga cccgccgttt 120 
gtgcctccgc ggtacctgcg gcctaccggg gggagaaaca gcatccgtta ctctgagttg 180 
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gcacccctat tcgacaccac ccgtgtgtac ctggtggaca acaagtcaac ggatgtggca 240 
tccctgaact accagaacga ccacagcaac tttctgacca cggtcattca aaacaatgac 300 
tacagcccgg gggaggcaag cacacagacc atcaatcttg acgaccggtc gcactggggc 360 
ggcgacctga aaaccatcct gcataccaac atgccaaatg tgaacgagtt catgtttacc 420 
aataagttta aggcgcgggt gatggtgtcg cgcttgccta ctaaggacaa tcaggtggag 480 
ctgaaatacg agtgggtgga gttcacgctg cccgagggca actactccga gaccatgacc 540 
atagacctta tgaacaacgc gatcgtggag cactacttga aagtgggcag acagaacggg 600 
gttctggaaa gcgacatcgg ggtaaagttt gacacccgca acttcagact ggggtttgac 660 
cccgtcactg gtcttgtcat gcctggggta tatacaaacg aagccttcca tccagacatc 720 
attttgctgc caggatgcgg ggtggacttc acccacagcc gcctgagcaa cttgttgggc 780 
atccgcaagc ggcaaccctt ccaggagggc tttaggatca cctacgatga tctggagggt 840 
ggtaacattc ccgcactgtt ggatgtggac gcctaccagg cgagcttgaa agatgacacc 900 
gaacagggcg ggggtggcgc aggcggcagc aacagcagtg gcagcggcgc ggaagagaac 960 
tccaacgcgg cagccgcggc aatgcagccg gtggaggaca tgaacgatag ccgcggctac 1020 
ccctacgacg tgcccgacta cgcgggcacc agcgccacac gggctgagga gaagcgcgct 1080 
gaggccgaag cagcggccga agctgccgcc cccgctgcgc aacccgaggt cgagaagcct 1140 
cagaagaaac cggtgatcaa acccctgaca gaggacagca agaaacgcag ttacaaccta 1200 
ataagcaatg acagcacctt cacccagtac cgcagctggt accttgcata caactacggc 1260 
gaccctcaga ccggaatccg ctcatggacc ctgctttgca ctcctgacgt aacctgcggc 1320 
tcggagcagg tctactggtc gttgccagac atgatgcaag accccgtgac cttccgctcc 1380 
acgcgccaga tcagcaactt tccggtggtg ggcgccgagc tgttgcccgt gcactccaag 1440 
agcttctaca acgaccaggc cgtctactcit: caactcatcc gccagtttac ctctctgacc 1500 
cacgtgttca atcgctttcc cgagaaccar; attttggcgc gcccgccagc ccccaccatc 1560 
accaccgtca gtgaaaacgt tcctgctct;: acagatcacg ggacgctacc gctgcgcaac 1620 
agcatcggag gagtccagcg agtgaccatt actgacgcca gacgccgcac ctgcccctac 1680 
gtttacaagg ccctgggcat agtctcgccg cgcgtcctat cgagccgcac tttttga 1737 

<210> 68 
<211> 578 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Ad5 PD1 penton 
<400> 68 



Met Arg Arg Ala Ala Met 


Tyr 


Glu 


Glu Gly Pro 


Pro 


Pro 


Ser 


Tyr 


Glu 


1 5 






10 










15 




Ser Val Val Ser Ala Ala 


Pro 


Val 


Ala Ala 


Ala 


Leu 


Gly 


Ser 


Pro 


Phe 


20 






25 








30 






Asp Ala Pro Leu Asp Pro 


Pro 


Phe 


Val Pro 


Pro 


Arg 


Tyr 


Leu 


Arg 


Pro 


35 




40 








45 








Thr Gly Gly Arg Asn Ser 


lie 


Arg 


Tyr Ser 


Glu 


Leu 


Ala 


Pro 


Leu 


Phe 


50 


55 








60 










Asp Thr Thr Arg Val Tyr 


Leu 


Val 


Asp Asn 


Lys 


Ser 


Thr 


Asp 


Val 


Ala 


65 70 








75 










80 


Ser Leu Asn Tyr Gin Asn Asp His 


Ser Asn 


Phe 


Leu 


Thr 


Thr 


Val 


He 


85 






90 










95 




Gin Asn Asn Asp Tyr Ser 


Pro 


Gly Glu Ala 


Ser 


Thr 


Gin 


Thr 


He 


Asn 


100 






105 








110 






Leu Asp Asp Arg Ser His 


Trp Gly Gly Asp Leu Lys 


Thr 


He 


Leu 


His 


115 




12 0 








125 








Thr Asn Met Pro Asn Val 


Asn 


Glu 


Phe Met 


Phe 


Thr 


Asn 


Lys 


Phe 


Lys 


130 


135 








140 








Ala Arg Val Met Val Ser 


Arg 


Leu 


Pro Thr 


Lys 


Asp 


Asn 


Gin 


Val 


Glu 


145 150 








155 










160 


Leu Lys Tyr Glu Trp. Val 


Glu 


Phe 


Thr Leu 


Pro 


Glu 


Gly 


Asn 


Tyr 


Ser 


165 






170 










175 




Glu Thr Met Thr lie Asp 


Leu 


Met 


Asn Asn 


Ala 


lie 


Val 


Glu 


His 


Tyr 


180 






185 








190 






Leu Lys Val Gly Arg Gin Asn Gly Val Leu Glu Ser 


Asp 


He 


Gly 


Val 
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195 200 205 

Lys Phe Asp Thr Arg Asn Phe Arg Leu Gly Phe Asp Pro Val Thr Gly 

210 215 220 

Leu Val Met Pro Gly Val Tyr Thr Asn Glu Ala Phe His Pro Asp He 
225 230 235 240 

He Leu Leu Pro Gly Cys Gly Val Asp Phe Thr His Ser Arg Leu Ser 

245 250 255 

Asn Leu Leu Gly He Arg Lys Arg Gin Pro Phe Gin Glu Gly Phe Arg 

260 265 270 

He Thr Tyr Asp Asp Leu Glu Gly Gly Asn He Pro Ala Leu Leu Asp 

275 280 285 

Val Asp Ala Tyr Gin Ala Ser Leu Lys Asp Asp Thr Glu Gin Gly Gly 

290 295 300 

Gly Gly Ala Gly Gly Ser Asn Ser Ser Gly Ser Gly Ala Glu Glu Asn 
305 310 315 320 

Ser Asn Ala Ala Ala Ala Ala Met Gin Pro Val Glu Asp Met Asn Asp 

325 330 335 

Ser Arg Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Thr Ser Ala 

340 345 350 

Thr Arg Ala Glu Glu Lys Arg Ala Glu Ala Glu Ala Ala Ala Glu Ala 

355 360 365 

Ala Ala Pro Ala Ala Gin Pro Glu Val Glu Lys Pro Gin Lys Lys Pro 

370 375 380 

Val He Lys Pro Leu Thr Glu Asp Ser Lys Lys Arg Ser Tyr Asn Leu 
385 390 395 400 

He Ser Asn Asp Ser Thr Phe Thr- Gin Tyr Arg Ser Trp Tyr Leu Ala 

405 410 415 

Tyr Asn Tyr Gly Asp Pro Gin Thr Gly He Arg Ser Trp Thr Leu Leu 

420 425 430 

Cys Thr Pro Asp Val Thr Cys Gly Ser Glu Gin Val Tyr Trp Ser Leu 

435 440 445 

Pro Asp Met Met Gin Asp Pro Val Thr Phe Arg Ser Thr Arg Gin He 

450 455 460 

Ser Asn Phe Pro Val Val Gly Ala Glu Leu Leu Pro Val His Ser Lys 
465 470 475 480 

Ser Phe Tyr Asn Asp Gin Ala Val Tyr Ser Gin Leu He Arg Gin Phe 

485 490 495 

Thr Ser Leu Thr His Val Phe Asn Arg Phe Pro Glu Asn Gin He Leu 

500 505 510 

Ala Arg Pro Pro Ala Pro Thr He Thr Thr Val Ser Glu Asn Val Pro 

515 520 525 

Ala Leu Thr Asp His Gly Thr Leu Pro Leu Arg Asn Ser He Gly Gly 

530 535 540 

Val Gin Arg Val Thr He Thr Asp Ala Arg Arg Arg Thr Cys Pro Tyr 
545 550 555 560 

Val Tyr Lys Ala Leu Gly He Val Ser Pro Arg Val Leu Ser Ser Arg 
565 570 575 

Thr Phe 



<210> 69 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 5TS35H 
<400> 69 

atgaagcgcg caagaccgtc tgaagatacc , ttcaaccccg tgtatccata tgacacggaa 60 
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accggtcctc caactgtgcc ttttcttact cctccctttg tatcccccaa tgggtttcaa 120 
gagagtcccc ctggggtact ctctttgcgc ctatccgaac ctctagttac ctccaatggc 180 
atgcttgcgc tcaaaatggg caacggcctc tctctggacg aggccggcaa ccttacctcc 240 
caaaatgtaa ccactgtgag cccacctctc aaaaaaacca agtcaaacat aaacctggaa 300 
atatctgcac ccctcacagt tacctcagaa gccctaactg tggctgccgc cgcacctcta 360 
atggtcgcgg gcaacacact caccatgcaa tcacaggccc cgctaaccgt gcacgactcc 420 
aaacttagca ttgccaccca aggacccctc acagtgtcag aaggaaagct agccctgcaa 4 80 
acatcaggcc ccctcaccac caccgatagc agtaccctta ctatcactgc ctcaccccct 540 
ctaactactg ccactggtag cttgggcatt gacttgaaag agcccattta tacacaaaat 600 
ggaaaactag gactaaagta cggggctcct ttgcatgtaa cagacgacct aaacactttg 660 
accgtagcaa ctggtccagg tgtgactatt aataatactt ccttgcaaac taaagttact 720 
ggagccttgg gttttgattc acaaggcaat atgcaactta atgtagcagg aggactaagg 780 
attgattctc aaaacagacg ccttatactt gatgttagtt atccgtttga tgctcaaaac 840 
caactaaatc taagactagg acagggccct ctttttataa actcagccca caacttggat 900 
attaactaca acaaaggcct ttacttgttt acagcttcaa acaattccaa aaagcttgag 960 
gttaacctaa gcactgccaa ggggttgatg tttgacgcta cagccatagc cattaatgca 1020 
ggagatgggc ttgaatttgg ttcacctaat gcaccaaaca caaatcccct caaaacaaaa 1080 
attggccatg gcctagaatt tgattcaaac aaggctatgg ttcctaaact aggaactggc 1140 
cttagttttg acagcacagg tgccattaca gtaggaaaca aaaataatga taagctaact 1200 
ttgtggaccg gaataaaccc tccacctaac tgtcaaattg tggaaaacac taatacaaat 1260 
gatggcaaac ttactttagt attagtaaaa aatggagggc ttgttaatgg ctacgtgtct 1320 
ctagttggtg tatcagacac tgtgaaccaa atgttcacac aaaagacagc aaacatccaa 1380 
ttaagattat attttgactc ttctggaaat ctattaactg aggaatcaga cttaaaaatt 1440 
ccacttaaaa ataaatcttc tacagcgacc agtgaaactg tagccagcag caaagccttt 1500 
atgccaagta ctacagctta tcccttcaac accactacta gggatagtga aaactacatt 1560- 
catggaatat gttactacat gactagttat gatagaagtc tatttccctt gaacatttct 1620 
ataatgctaa acagccgtat gatttcttcc aatgttgcct atgccataca atttgaatgg 1680. 
aatctaaatg caagtgaatc tccagaaagc aacatagcta cgctgaccac atcccccttt 1740. 
ttcttttctt acattacaga agacgacgaa taa 1773 

<210> 70 
<211> 590 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 5TS35H 



<400> 70 



Met Lys 


Arg 


Ala 


Arg 


Pro 


Ser 


Glu 


Asp 


Thr Phe Asn Pro 


Val 


Tyr 


Pro 


1 




5 










10 




15 




Tyr Asp 


Thr 


Glu 


Thr 


Gly 


Pro 


Pro 


Thr 


Val Pro Phe Leu 


Thr 


Pro 


Pro 




20 










25 




30 






Phe Val 


Ser 


Pro 


Asn 


Gly 


Phe 


Gin 


Glu 


Ser Pro Pro Gly 


Val 


Leu 


Ser 




35 










40 




45 








Leu Arg 


Leu 


Ser 


Glu 


Pro 


Leu 


Val 


Thr 


Ser Asn Gly Met 


Leu 


Ala 


Leu 


50 










55 






60 








Lys Met 


Gly 


Asn 


Gly 


Leu 


Ser 


Leu 


Asp 


Glu Ala Gly Asn 


Leu 


Thr 


Ser 


65 








70 








75 






80 


Gin Asn 


Val 


Thr 


Thr 


Val 


Ser 


Pro 


Pro 


Leu Lys Lys Thr 


Lys 


Ser 


Asn 








85 










90 




95 




lie Asn 


Leu 


Glu 


lie 


Ser 


Ala 


Pro 


Leu 


Thr Val Thr Ser 


Glu 


Ala 


Leu 






100 










105 




110 






Thr Val 


Ala 


Ala 


Ala 


Ala 


Pro 


Leu 


Met 


Val Ala Gly Asn Thr 


Leu 


Thr 




115 










120 




125 








Met Gin 


Ser 


Gin 


Ala 


Pro 


Leu 


Thr 


Val 


His Asp Ser Lys 


Leu 


Ser 


He 


13 0 










135 






140 








Ala Thr 


Gin 


Gly 


Pro 


Leu 


Thr 


Val 


Ser 


Glu Gly Lys Leu 


Ala 


Leu 


Gin 


145 






150 








155 






160 


Thr Ser 


Gly 


Pro 


Leu 


Thr 


Thr 


Thr 


Asp 


Ser Ser Thr Leu 


Thr 


He 


Thr 






165 










170 




175 
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Ala Ser Pro Pro Leu Thr Thr Ala Thr Gly Ser Leu Gly lie Asp Leu 

180 185 190 

Lys Glu Pro lie Tyr Thr Gin Asn Gly Lys Leu Gly Leu Lys Tyr Gly 

195 200 205 

Ala Pro Leu His Val Thr Asp Asp Leu Asn Thr Leu Thr Val Ala Thr 

210 215 220 

Gly Pro Gly Val Thr lie Asn Asn Thr Ser Leu Gin Thr Lys Val Thr 
225 230 235 240 

Gly Ala Leu Gly Phe Asp Ser Gin Gly Asn Met Gin Leu Asn Val Ala 

245 250 1 255 

Gly Gly Leu Arg lie Asp Ser Gin Asn Arg Arg Leu lie Leu Asp Val 

260 265 270 

Ser Tyr Pro Phe Asp Ala Gin Asn Gin Leu Asn Leu Arg Leu Gly Gin 

275 280 285 

Gly Pro Leu Phe lie Asn Ser Ala His Asn Leu Asp lie Asn Tyr Asn 

290 295 300 

Lys Gly Leu Tyr Leu Phe Thr Ala Ser Asn Asn Ser Lys Lys Leu Glu 
305 310 315 320 

Val Asn Leu Ser Thr Ala Lys Gly Leu Met Phe Asp Ala Thr Ala lie 

325 330 335 

Ala lie Asn Ala Gly Asp Gly Leu Glu Phe Gly Ser Pro Asn Ala Pro 

340 345 350 

Asn Thr Asn Pro Leu Lys Thr Lys lie Gly His Gly Leu Glu Phe Asp 

355 360 365 

Ser Asn Lys Ala Met Val Pro Lys Leu Gly Thr Gly Leu Ser Phe Asp 

370 375 380 

Ser Thr Gly Ala He Thr Val Gly Asn Lys Asn Asn Asp Lys Leu Thr 
385 390 395 400 

Leu Trp Thr Gly He Asn Pro Pro Pro Asn Cys Gin He Val Glu Asn 

405 410 415 

Thr Asn Thr Asn Asp Gly Lys Leu Thr Leu Val Leu Val Lys Asn Gly 

420 425 430 

Gly Leu Val Asn Gly Tyr Val Ser Leu Val Gly Val Ser Asp Thr Val 

435 440 445 

Asn Gin Met Phe Thr Gin Lys Thr Ala Asn lie Gin Leu Arg Leu Tyr 

450 455 460 

Phe Asp Ser Ser Gly Asn Leu Leu Thr Glu Glu Ser Asp Leu Lys He 
465 470 475 480 

Pro Leu Lys Asn Lys Ser Ser Thr Ala Thr Ser Glu Thr Val Ala Ser 

485 490 495 

Ser Lys Ala Phe Met Pro Ser Thr Thr Ala Tyr Pro Phe Asn Thr Thr 

500 505 510 

Thr Arg Asp Ser Glu Asn Tyr He His Gly He Cys Tyr Tyr Met Thr 

515 520 525 

Ser Tyr Asp Arg Ser Leu Phe Pro Leu Asn He Ser He Met Leu Asn 

530 535 540 

Ser Arg Met He Ser Ser Asn Val Ala Tyr Ala He Gin Phe Glu Trp 
545 * 550 555 560 

Asn Leu Asn Ala Ser Glu Ser Pro Glu Ser Asn He Ala Thr Leu Thr 

565 570 575 

Thr Ser Pro Phe Phe Phe Ser Tyr He Thr Glu Asp Asp Glu 
580 585 590 



<210> 71 
<211> 945 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> 35TS5H 
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<400> 71 

atgaccaaga gagtccggct cagtgactcc ttcaaccctg tctaccccta tgaagatgaa 60 
agcacctccc aacacccctt tataaaccca gggtttattt ccccaaatgg cttcacacaa 120 
agcccagacg gagttcttac tttaaaatgt ttaaccccac taacaaccac aggcggatct 180 
ctacagctaa aagtgggagg gggacttaca gtggatgaca ctgatggtac cttacaagaa 240 
aacatacgtg ctacagcacc cattactaaa aataatcact ctgtagaact atccattgga 300 
aatggattag aaactcaaaa caataaacta tgtgccaaat tgggaaatgg gttaaaattt 360 
aacaacggtg acatttgtat aaaggatagt attaacacct tatggactac accagctcca 420 
tctcctaact gtagactaaa tgcagagaaa gatgctaaac tcactttggt cttaacaaaa 480 
tgtggcagtc aaatacttgc tacagtttca gttttggctg ttaaaggcag tttggctcca 540 
atatctggaa cagttcaaag tgctcatctt attataagat ttgacgaaaa tggagtgcta 600 
ctaaacaatt ccttcctgga cccagaatat tggaacttta gaaatggaga tcttactgaa 660 
ggcacagcct atacaaacgc tgttggattt atgcctaacc tatcagctta tccaaaatct 720 
cacggtaaaa ctgccaaaag taacattgtc agtcaagttt acttaaacgg agacaaaact 780 
aaacctgtaa cactaaccat tacactaaac ggtacacagg aaacaggaga cacaactcca 840 
agtgcatact ctatgtcatt ttcatgggac tggtctggcc acaactacat taatgaaata 900 
tttgccacat cctcttacac tttttcatac attgcccaag aataa 945 

<210> 72 
<211> 314 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 35TS5H 
<400> 72 

Met Thr Lys Arg Val Arg Leu Ser Asp Ser Phe Asn Pro Val Tyr Pro 

1 5 10 15 

Tyr Glu Asp Glu Ser Thr Ser Gin His Pro Phe lie Asn Pro Gly Phe 

20 25 30 

He Ser Pro Asn Gly Phe Thr Gin Ser Pro Asp Gly Val lieu Thr Leu 

35 40 45 

Lys Cys Leu Thr Pro Leu Thr Thr Thr Gly Gly Ser Leu Gin Leu Lys 

50 55 60 

Val Gly Gly Gly Leu Thr Val Asp Asp Thr Asp Gly Thr Leu Gin Glu 
65 70 75 80 

Asn He Arg Ala Thr Ala Pro He Thr Lys Asn Asn His Ser Val Glu 

85 90 95 

Leu Ser He Gly Asn Gly Leu Glu Thr Gin Asn Asn Lys Leu Cys Ala 

100 105 110 

Lys Leu Gly Asn Gly Leu Lys Phe Asn Asn Gly Asp He Cys He Lys 

115 120 125 

Asp Ser He Asn Thr Leu Trp Thr Thr Pro Ala Pro Ser Pro Asn Cys 

130 135 140 

Arg Leu Asn Ala Glu Lys Asp Ala Lys Leu Thr Leu Val Leu Thr Lys 
145 150 155 160 

Cys Gly Ser Gin He Leu Ala Thr Val Ser Val Leu Ala Val Lys Gly 

165 170 175 

Ser Leu Ala Pro He Ser Gly Thr Val Gin Ser Ala His Leu He He 

180 185 190 

Arg Phe Asp Glu Asn Gly Val Leu Leu Asn Asn Ser Phe Leu Asp Pro 

195 200 205 

Glu Tyr Trp Asn Phe Arg Asn Gly Asp Leu Thr Glu Gly Thr Ala Tyr 

210 215 220 

Thr Asn Ala Val Gly Phe Met Pro Asn Leu Ser Ala Tyr Pro Lys Ser 
225 230 235 240 

His Gly Lys Thr Ala Lys Ser Asn He Val Ser Gin Val Tyr Leu Asn 

245 250 255 

Gly Asp Lys Thr Lys Pro Val Thr Leu Thr He Thr Leu Asn Gly Thr 
260 265 270 
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Gln Glu Thr Gly Asp Thr Thr Pro Ser Ala Tyr Ser Met Ser Phe Ser 

27 5 280 285 

Trp Asp Trp Ser Gly His Asn Tyr He Asn Glu He Phe Ala Thr Ser 

290 295 300 

Ser Tyr Thr Phe Ser Tyr He Ala Gin Glu 
305 310 



